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The invention provides for a chemical method for preparing a recombinant single copy polypeptide or a portion thereof 
with a modified terminal amino acid a-carbon reactive group selected from the group consisting of N-terminal a-amine, C-termi- 
nal a-carboxyl, and a combination thereof. The steps of the method involve forming the recombinant single copy polypeptide or a 
portion thereof so that the single copy polypeptide is protected with one or more biologically added protecting groups at the N- 
terminal a-amine, C-terminal a-carboxyl. The recombinant single copy polypeptide can then be reacted with up to three chemical 
protecting agents to selectively protect reactive side chain groups and thereby prevent side chain groups from being modified. The 
recombinant single copy polypeptide can be cleaved with at least one cleavage reagent specific for the biological protecting group 
to form an unprotected terminal amino acid a>carbon reactive group. The unprotected terminal amino acid a-carbon reactive 
group is modified with at least one chemical modifying agent. The side chain protected terminally modified single copy polypep- 
tide is then deprotected at the side chain groups to form a terminally modified recombinant single copy polypeptide. The number 
and sequence of steps in the method can be varied to achieve selective modification at the N- and/or C-terminal amino acid of a 
reoombinantiy produced polypeptide. 
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METHOD FOR MODIFICATION OP 
RECOMBINANT POLYPEPTIDES 

5 

Backaromid of the Invention 

Many naturally occurring proteins and peptides 
have been produced by recombinant DNA techniques. 
Recombinant DNA techniques have made possible the 

10 selection, amplification and manipulation of expression 
of the proteins and peptides. For example, changes in 
the sequence of the recombinantly produced proteins or 
peptides can be accomplished by altering the DNA 
sequence by techniques like site-directed or deletion 

15 mutagenesis • 

However, some modifications to a recombinantly 
produced protein or peptide can not be accomplished by 
altering the DNA sequence. For example, the C-terminal 
a-carboxyl group in many naturally occurring protein and 

20 peptides often exists as an amide, but this amide 

typically is not produced through recombinant expressing 
and is biologically converted after expression in vivo 
from a precursor protein to the amide. Another example, 
is the addition of a D-amino acid to the N- and/or 

25 C-terminal end of a recombinantly produced protein or 
peptide. 

In addition, it may be, desirable to selectively 
modify both the N- and C-terminal a-carbon reactive 
groups of a recombinantly produced protein or peptide. 

30 Recombinantly produced protein or polypeptides have a 
multiplicity of reactive side chain groups, as well as 
the N- and C-terminal amino acid a-carbon reactive 
groups. Side chain reactive groups include thiols, 
carboxyls, imidazoles, and €-amine reactive groups. 

35 Selective modifications at the N- and/or C-terminal 
a-carbon reactive groups, such as adding an N-terminal 
pyroglutamyl residue and/or forming an amide at the 
C-terminal amino acid, need to be conducted without 
adversely affecting the reactive side chain groups. 
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A method of forming a C-terminal amide on a 
recorobineuitly produced polypeptide by the action of an 
enzyme is known. The enzyme is peptidyl glycine 
a-amidating monoxygenase and is present in eulcaryotic 
5 systems. The enzyme has been used to form an amide on 
the C-terminal amino acid of recombinantly produced 
peptides, like hiiman growth hozrmone releasing hormone in 
vitro as described by J. Engels, Protein Engineering ^ 
1:195-199 (1987) • 

10 In addition many recombinantly produced small 

proteins and peptides have a limited number of reactive 
side chain groups. For example, the 27 amino acid human 
gastrin releasing peptide contains N-terminal a-amine 
and side chain hydroxyl and e-amine reactive groups. 

15 The myosin light chain kinase inhibitor contains 10 
amino acids and has terminal a-amine and side chain 
amine reactive groups. The C-terminal a-carboxyl groups 
are amidated in both of these naturally occurring 
peptides. Although these types of small proteins and 

20 peptides have a limited number of different reactive 

groups, they liave been amidated through the traditional 
method of enzymatic C-terminal amidation. While 
selective, the enzymatic method is time constiming, 
expensive, gives unpredictable yields, and requires 

25 significant post reaction purification. The enzymatic 
method is also limited to modifying the recombinantly 
produced peptide by C-terminal amidation. 

Accordingly, there is a need for a chemical 
method that provides for selective modification of 

30 either or both N- terminal a^eunine and C-terminal 
a-carboxyl groups of a recombinantly produced 
polypeptide. This method results in selective 
modifications to one or both terminal amino acid 
a-carbon reactive groups and does not adversely affect 

35 the reactive side chain groups. There is also a need 
for a method of selective modification that allows 
addition of a variety of different organic moieties to 
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the N- and/or C-terminal a-carbon reactive groups of a - 
recombinantly produced polypeptide and that is 
convenient, cheap and capable of producing terminally, 
modified recombinant polypeptides in high yield. 
5 Therefore, it is an object of the invention to develop a 
chemical method for selective modification of N-terminal 
a*amine and/or C-terminal a-ccu:boxyl reactive groups of 
a recombinantly produced polypeptide. 

10 Summary of the Invention 

These and other objects are accomplished by the 
present invention. The invention provides for a 
chemical method for preparing a recombinant single copy 
polypeptide or portion thereof with a modified terminal 

15 amino acid a-carbon reactive group selected from the 
group consisting of an N-terminal a-amine, C- terminal 
oc-carboxyl and a combination thereof. The recombinant 
single copy polypeptide also has reactive side chain 
groups selected from the group consisting of an e-amine 

20 group, a hydroxyl group, a iB^carboxyl group, a 

r-carboxyl, a thiol group, and a combination thereof. 

The steps of the method involve forming the 
recombinant single copy polypeptide or a portion thereof 
so that the single copy polypeptide is protected with 

25 one or more biologically added protecting groups at the 
N-terminal a-amine and/or the C-terminal a-carboxyl. 
The recombinant single copy polypeptide is then reacted 
with up to three chemical protecting agents to 
selectively protect reactive side chain groups to form a 

30 side chain protected recombinant single copy polypeptide 
and thereby prevent the side chain group from being 
modified during the modification reaction. The 
recombinant single copy polypeptide is cleaved with at 
least one cleavage reagent specific for the biologically 

35 added protecting group to form a recombinant polypeptide 
with unprotected terminal amino acid a-carbon reactive 
group. Alternatively, the single copy polypeptide can 
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be cleaved with at least one cleavage reagent specific - 
for the biological protecting group followed by reaction 
with up to three chemical protecting agents. In either 
case, a side chain protected single copy polypeptide 
5 having an unprotected terminal amino acid a-carbon 
reactive group Is produced. The unprotected terminal 
amino acid a-carbon reactive group Is then modified with 
at least one chemical modifying agent. The resulting 
side chain protected terinlnally modified single copy 

10 polypeptide Is then deprotected at the side chain groups 
to form a terminally modified recombinant single copy 
polypeptide. 

The recombinant single copy polypeptide or 
portion thereof Is formed with one or more biologically 

15 added protecting group on the terminal amino acid 
a-carbon reactive groups. The biologically added 
protecting group can be a peptide, a polypeptide, amino 
acid, or a combination thereof connected to the N- 
and/or C-termlnal a-carbon reactive groups by an amide 

20 bond connection. The biological protecting group bond 
is stable and generally irreversible and, thus, contains 
at least one recognition sequence that is cleavable 
enzymatically or chemically. The recombinant 
polypeptide with one or more biologically added 

25 protecting groups is formed by incorporating the DNA 

sequence for the biologically added protecting group or 
groups into the expression cassettes adjacent to the 
sequence for the recomblnantly produced protein or 
peptide • 

30 For example, the recombinant single copy 

polypeptide can be formed as a single copy fusion 
protein. The single copy fusion protein has a binding 
protein connected via an interconnecting peptide to the 
single copy polypeptide at either the N- and/or 

35 C-terminal a-carbon reactive group. The interconnecting 
peptide has at least one site that is cleavable by a 
chemical or enzymatic reagent and serves as a biological 
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protecting group. The binding protein and 
interconnecting peptide not only serve as a biological 
protecting group, but also aid in purification of the 
recombinant single copy polypeptide. For example, a 
5 single copy fusion protein having a binding protein of 
carbonic anhydrase and a polypeptide of any peptide 
sequence can be purified through use of an immobilized 
reversible inhibitor such as benzene sulfonamide. 
Further, the carbonic anhydrase can be modified to 

10 eliminate cleavage sites which would also be cleaved 
along with cleavage of the interconnecting peptide. In 
a preferred embodiment, two cleavage sites can be 
incorporated within the interconnecting peptide so that 
after purification of the fusion protein, the binding 

15 protein can be cleaved to leave a short peptide sequence 
(e.g., the interconnecting peptide) as the biological 
protecting group for the single copy polypeptide. This 
demifusion protein can be modified according to the 
invention to protect its reactive side chain groups. 

20 The short peptide sequence residue acts as the 

biological protecting group of the N*terminal a*-amine of 
the demifusion protein. Enzymatic or chemical cleavage 
of this short peptide sequence releases the free N- 
terminal a-amine for further modification according to 

25 the invention. 

The recombinant single copy polypeptide can 
also be formed having only a portion of the amino acid 
sequence of the desired polypeptide or as a truncated 
version of the polypeptide. Preferably, the portion of 

30 the sequence is lacking from about 1 to about 10 of the 
terminal amino acids of the polypeptide. The portion of 
the recombinant single copy polypeptide is formed so 
that it is biologically protected at the N- and/or 
C-terminal end with a polypeptide, peptide, or amino 

35 acid as described above. The portion of or truncated 
version of the single copy polypeptide can also be 
formed as a multicopy polypeptide or fusion protein. 
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The starting material of the invention can also 
be recombinantly formed as a multicopy polypeptide or 
fusion protein. The multicopy polypeptide has several 
copies of the single copy polypeptide tandomly linked 
5 together with or without an intraconnecting peptide. If 
an intraconnecting peptide is present, it has at leasr 
one site that is selectively cleavable by a chemical or 
enzymatic cleavage reagent. The intraconnecting peptide 
also acts as a biological protecting group at the 

10 C-terminal portion of one or more single copy 
polypeptides incorporated into the multicopy 
polypeptide. A multicopy fusion protein has three 
tandomly linked segments including a binding protein 
connected via an interconnecting peptide to the 

15 multicopy polypeptide. The interconnecting peptide has 
at least one site that is selectively cleavable by a 
chemical or enzymatic method and is preferably different 
from the intraconnecting peptide. The binding protein 
with interconnecting peptide acts as a biological 

20 protecting group and aids in the purification of the 
recombination multicopy polypeptide. In a preferred 
multicopy embodiment like the embodiment described above 
for the single copy fusion protein, the multicopy 
polypeptide can have as a binding protein carbonic 

25 anhydrase. The carbonic anhydrase can be modified so 
that it does not contain cleavage sites which are to be 
used in both the interconnecting peptide and the 
intraconnecting peptide. The interconnecting peptide 
preferably contains at least two cleavage sites. After 

30 separation and purification through use of the binding 
protein, the binding protein fragment is removed by 
cleavage at a unique cleavage site within the 
interconnecting polypeptide. Separation of the binding 
protein fragment from the multicopy polypeptide and side 

35 chain protection according to the invention produces a 
protected multicopy polypeptide ready for cleavage into 
single copies and release of the free N-terminal a- 
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amine. Selection and addition of the appropriate 
enzyme, enzymes or chemicals for cleavage of the 
biological protecting group and/or the intraconnecting 
peptides releases the free a-amine or free a-carboxyl 
5 group of the several copies of the desired polypeptide. 
The protected polypeptide can then be modified at the N- 
terminal or C«- terminal or both as desired. 

The starting materials of the invention are 
selected and recombinant ly produced with biologically 

10 added protecting groups. The starting materials can 

include a biologically protected recombinant single copy 
polypeptide or portion thereof, a recombinant single 
copy fusion protein, a recombinant multicopy fusion 
protein, and a biologically protected recombinant 

15 multicopy polypeptide. The preferred starting material 
is a recombinant single or multicopy fusion protein. 

Once the starting material of the invention is 
selected and formed, the starting material is treated to 
produce a protected single copy polypeptide having an 

20 unprotected tezminal amino acid a-carbon reactive group. 
The starting material is reacted with up to three 
chemical protecting agents to form a side chain 
protected molecule to prevent reaction of side chain 
reactive groups with the modification agent. The 

25 starting material is cleaved with a cleavage reagent 

specific for the biologically added protecting group to 
form an unprotected terminal amino acid a-carbon 
reactive group. The number and sequence of steps of 
cleaving and reacting the starting material with up to 

30 three chemical protecting agents can vary depending on 
several factors, including: 

(a) if the starting material of the invention 
is a multicopy polypeptide or fusion protein, extra 
cleavage steps can be required; 

35 (b) if the modification desired is at the N- 

and/or C-terminal a-carbon reactive group, extra 
cleavage and modification steps are required; 
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(c) the amino acid seqpience of the desired 
polypeptide, the number of side chain reactive groups, 
and whether a cleavage recognition sequence is present 
will influence whether the polypeptide is protected 

5 first or cleaved first; and 

(d) the type of modification - for example, 
some types of modification reactions do not require 
protection of side chain reactive groups. 

. The number and sequence of cleaving and 

10 reacting steps are selected to achieve a protected 

single copy polypeptide having an unprotected terminal 
a-carbon reactive group. For example, a recombinant 
multicopy fusion protein can be terminally modified as 
follows. The recombinant multicopy fusion protein is 

15 recombinantly formed having a binding protein connected 
to an interconnecting peptide which is connected to the 
N* or C-terminal end of the multicopy polypeptide. The 
multicopy polypeptide has several copies of the single 
copy polypeptide connected with intraconnecting 

20 peptides. The interconnecting peptide and 

intraconnecting peptide act as biological protecting 
groups and each have at least one chemical or enzymatic 
cleavage site. The multicopy fusion protein is first 
cleaved with cleavage reagents specific for the 

25 interconnecting peptide to form a multicopy polypeptide. 
The multicopy polypeptide is then reacted with up to 
three chemical protecting agents to protect reactive 
side chain groups followed by cleavage with at least one 
cleavage reagent specific for the biologically added 

30 protecting group or in the reverse order. The cleavage 
reagent specific for the biologically added protecting 
groups act to cleave at the intraconnecting peptide and 
to remove remaining intraconnecting peptide residues. 
In either case, a protected single copy polypeptide 

35 having an unprotected terminal amino acid o-carbon 
reactive group is produced. The terminal a-carbon 
reactive group is modified. The terminally modified 
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single copy polypeptide Is deprotected to yield a 
terminally modified recombinant single copy polypeptide. 

The unprotected terminal a-carbon reactive 
groups can be modified by reaction with a chemical 
5 modifying agent. The modifying agent acts to add or 
replace terminal amino acids with organic moieties. 
Specific examples of types of modifications include; 
C- terminal amidation; addition or replacement of 
terminal . amino acids with a D-amino acid, an L-amino 

10 acid, an amino acid derivative, or peptides having a 
combination thereof; formation of an N-acetyl group; 
formation of an N- terminal amide or other N* terminal 
addition moiety through reaction of an unprotected a-- 
amine group with a chemically produced oligopeptide or a 

15 synthetic organic moiety having a reactive group which 
will form a covalent bond with the N-terminal a-amine. 
Kodification can occur at one or both terminal a-carbon 
reactive groups. 

Once a protected recombinant single copy 

20 polypeptide is modified, it is deprotected under 

conditions allowing regeneration of the original side 
chain reactive groups. The final product is a 
terminally modified recombinantly produced single copy 
polypeptide. Modifications can change the biological 

25 activity or structure of the desired recombinant 
polypeptide . 

Detailed Description of the Invention 
Recombinant DNA techniques have made possible 

30 the selection, amplification, and manipulation of 
expression of many naturally occurring proteins and 
peptides. Nattirally occurring proteins and peptides 
recombinantly produced generally contain a multiplicity 
of amino acids having side chains with different 

35 functional or reactive groups including hydroxyl, 
thiols, carboxyls, and e-amine groups. Two other 
important reactive groups are the N-terminal a-amine 
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reactive group and the C-terminal a-carboxyl reactive - 
group. It is often desirable to selectively modify a 
recombinant polypeptide at the N- terminal a-amine and/or 
C-terminal a-carboxyl groups. For example, the 
5 C-terminal reactive carboxyl groups in some naturally 
occurring proteins and peptides can be selectively 
converted to an amide to provide for enhancement of 
biological activity. Alternatively, a D-amino acid or 
peptide could be added to or replace a terminal amino 
10 acid. 

These modifications can result in the formation 
of analogs of the recombinantly produced polypeptide 
that are longer acting and more potent than the 
naturally occurring polypeptide. Generally, these types 

15 of modifications to the recombinantly produced 

polypeptide are not accomplished by alteration of the 
DNA sequence for the recombinantly produced polypeptide 
because there is no genetic code providing for amino 
acid amides, or incorporation of D-amino acid or an 

20 amino acid derivative. 

The present invention provides a method for the 
selective modification of a recombinantly produced 
polypeptide at a terminal o-carbon reactive group 
selected from the group consisting of N- terminal 

25 a-amine, C-terminal a-carboxyl and a combination 

thereof. The first step in the method is to form the 
recombinantly produced single or multicopy polypeptide 
so that it is protected at one or both tesnninal a-carbon 
reactive groups with a biologically added protecting 

30 group. 

The biologically added protecting group is 
preferably an amino acid, peptide, and/or polypeptide 
that contains at least one site that is cleavable 
enzymatically or chemically, and preferably has a 
35 sequence that is not present in the sequence of the 

desired polypeptide. The biologically added protecting 
group can be added to the recombinantly produced 
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polypeptide by combining the DNA sequence for the 
biologically added protecting group to the 5' and/or 3' 
terminus of the gene encoding the desired polypeptide. 
Once formed, the recombinantly produced polypeptide, 
5 biologically protected at the terminal o-carbon reactive 
groups, is reacted with up to three chemical protecting 
agents to protect the side chain groups and then is 
cleaved with at least one cleavage reagent specific for 
at least one biologically added protecting group. 

10 Alternatively, the recombinant single copy polypeptide, 
biologically protected at the terminal a-carbon reactive 
group, is cleaved with a cleavage reagent specific for 
at least one biologically added protecting group and 
then reacted with up to three chemical protecting agents 

15 that act to protect side chain reactive groups • In 
either case, a polypeptide is produced having an 
unprotected N- or C-terminal a-carbon reactive group and 
protected side chain reactive groups. The unprotected 
terminal amino acid a-carbon reactive group is modified 

20 with a modifying agent to form a terminally modified 
protected single copy polypeptide. The terminally 
protected single copy polypeptide is then deprotected to 
form an N- and/or C-terminally modified single copy 
polypeptide . 

25 The sequence and number of steps in the method 

of the invention can be varied depending on the desired 
modification, the amino acid sequence of the desired 
polypeptide, and the starting material selected. The 
starting materials of the invention can include a 

30 recombinantly produced single copy polypeptide, or a 

portion thereof, a multicopy polypeptide, a single copy 
fusion protein, and a multicopy fusion protein. 

For example, the method of the invention 
provides for the selective N-terminal a-amine and 

35 C-terminal a-carboxyl modification of a recombinantly 
produced single copy polypeptide. A recombinantly 
produced single copy polypeptide is formed so that the 



wo 94/01451 



PCrAJS93/06591 



12 

N-tezxninal a-amine is biologically protected by an amide 
bond connection to an interconnecting peptide and 
optionally a binding protein and the C-terminal 
a-carboxyl is biologically protected by an amide bond 
5 connection to an arginine residue. The recombinant 
single copy polypeptide biologically protected at both 
the N- and C-terminal a-carbon reactive groups is then 
reacted with up to three chemical protecting agents so 
that the reactive side chain groups present in the 

10 recombinant single copy polypeptide axe protected and 
not available to react with the modifying agent. The 
protected single copy polypeptide is then cleaved with a 
cleavage reagent specific for the N-terminal biological 
protecting group and the unprotected a-amine group is 

15 reacted with a chemical modifying reagent. The modified 
side chain protected single copy polypeptide is then 
cleaved with a cleavage reagent specific for the 

terminal biological protecting group. The unprotected 
C-terminal a-carboxyl group is reacted with a second 

20 modifying agent to form a side chain protected 

N- terminal modified, C-terminal modified single copy 
polypeptide. The protected N-terminal, C-terminal 
modified single copy polypeptide is deprotected at the 
side chain reactive groups to form a recombinant single 

25 copy polypeptide modified at the N- and C-terminal ends 
of the molecule. The reaction scheme showing sequential 
N-terminal a-amine and C-terminal a-carboxyl 
modification of a recombinant single copy polypeptide is 
as follows: 
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10 



15 



20 



25 



30 



35 



40 



45 



Iteactiott Scheme It 

Selective Modification at the N- and C-Terminal 
amino Acid of a Recombinant Siaole Copy Polypeptide 

BPIi - scPP - Arg 

(1) Forming the recombinant single 
copy polypeptide biologically 
protected at (BFIx) N- and 
C- terminal (Arg) ends 

(2) Chemical protecting agents 
Arg 



BPI, - 



V 

SCPP 
I 

NHCOR 



MH, - 



UxMH - 



V 

SCPF 
I 

NHCOR 

I 

V 

SCPP 
I 

NHCOR 



(3) First cleavage reagent specific 
for the N-terminal biological 
protecting group 

- Arg + (BPIJ 



(4) First modifying agent 
Arg 



(5) Second cleavage reagent 

specific for the C-terminal 
biological protecting group 



V 

MiNH - scPP - COOH + (Arg) 
NHCOR 



(6) Second modifying agent 



MiNH SCPP COM, 
I 

NHCOR 
I 

V 

MxNH SCPP COM2 



(7) Deprotecting 



50 



55 



Key 

BPIi " scPP - Arg 



'recombinant single copy fusion 
protein (scPP) biologically protected 
a N-terminal a-amine by an amide bond 
to an interconnecting peptide (I J and 
an optional binding protein (BP) and 
protected at the C-terminal 
a-carboxyl with an arginine (Arg) 
residue 
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BPIj - scPP - Arg = recombinant single copy polypeptide 
I protected at the side chain 

NHCOR reactive groups (MHCOR) 

5 

NHj - scPP - Arg = side chain protected recombinant 
I single copy polypeptide with 

NHCOR unprotected N-terminal a-amine (NH2) 

10 MiNH - SCPP - Arg = side chain protected recombinant 
I single copy polypeptide with 

NHCOR modified N-terminal a-amine (NHHx) 

MiNH - scPP - COOH = N- terminally modified side chain 
15 I protected recombinant single copy 

NHCOR polypeptide with unprotected 

C-terminal a-carboxyl group 

MiNH - SCPP - COM2 - C-terminal (COM2) modified side 
20 I chain protected single copy 

NHCOR polypeptide 



25 



MiNH - SCPP - COM2 = N and C terminally modified single 

copy polypeptide 



Another variation of the method of the 
invention involves C-terminal modification of a single 
copy polypeptide derived from a recombinantly produced 

30 multicopy polypeptide. The multicopy polypeptide is 
formed with multiple copies of the desired polypeptide 
connected with intraconnecting peptides. The 
intraconnecting peptide acts as a biological protecting 
group for the C-terminal a-carboxyl reactive group of 

35 the single copy polypeptides. The recombinantly 
produced multicopy polypeptide is cleaved with a 
cleavage reagent specific for the intraconnecting 
p>eptide to form a first mixture of a single copy 
polypeptide with unprotected N-terminal a-amine and an 

40 unprotected C-terminal a-carboxyl group and a single 
copy polypeptide with an unprotected N-terminal a-amine 
and an intraconnecting peptide at the C-terminal 
a-carboxyl group. The first mixture is reacted with at 
least one chemical protecting agent that forms 

45 protecting groups at the reactive side chain groups and 
the unprotected N-terminal a-amine reactive group. The 



15 

intraconnectxng peptide at the C-*terniinal a-carboxyl 
group is then removed by cleavage with a cleavage 
reagent that digests the intraconnecting peptide 
residues to form a side chain protected single copy 
polypeptide having an unprotected C-terminal a-carboxyl 
group. The unprotected C-terminal a-carboxyl group is 
then modified with a modifying agent. The side chain 
protected single copy polypeptide with modified 
C-terminal a-carboxyl group is then deprotected to form 
the C-terminal modified single copy polypeptide. The 
reaction scheme depicting selective C-terminal 
modification of single copy polypeptide derived from a 
recombinantly produced multicopy polypeptide is as 
follows t 

Reaction Scheme II: 
Selective C-terminal Modification of a 
Single Copy Polypeptide Derived from a 
Recombinant Multicopy Polypeptide 

NH2mc(PPl2)nCOOH 

(1) Forming the recombinant multicopy 
polypeptide with intraconnecting 
peptide (Ij) as biologically 
added protective group 

(2) Cleavage reagent specific 
for intraconnecting peptide 

V 

NH2SC(PP)C00H First mixture 

NH28C(PP)l2 

(3) Chemical protecting agents 



NHCORsc(PP)COOH Second mixture 

NHC0Rsc(PP)l2 



NHCORscPPCOOH (4) Cleavage reagent specific for 

removal of the C-terminal 
biological protecting group 

V 

NHCORscPPCOM (5) Modifying agent 



V 

IIH2SC COM (6) Deprotecting 
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Key 

NH2mc{PPl2)nP COOH = multicopy polypeptide (mcPP) 

intra-connected with an 
5 intraconnecting peptide (I2) 

NHzScPPCOOH » single copy (sc) polypeptide 

with unprotected N-terminal 
a-amine and C*terminal COOH and 
side chain groups 

10 MH2SCPPI2 = single copy polypeptide with 

unprotected N-terroinal a-amine 
and C-terminal intraconnecting 
peptide residues 

NHCORscPPCOOH = side chain protected (NHCOR) 
15 single copy polypeptide 

NHCORSCPPI2 = side chain protected single copy 

polypeptide having C-terminal 
intraconnecting residues 

NHCORscCOli = side chain protected modified 
20 single copy polypeptide 

NH2SCPPCOM = terminally modified single copy 

polypeptide 

25 Other variations of the method of the invention 

involving the number and sequence of the steps can be 
utilized to achieve selective modification of the N- 
and/or C-terminal a-carbon reactive group of a 
recombinantly produced polypeptide. The combination of 

30 steps that will be appropriate to result in selective N- 
and/or C-terminal modification depends on the selection 
of: 

(a) the starting material - a multicopy 
polypeptide or fusion protein can require additional 

35 cleavage steps to form single copy polypeptides; 

(b) whether the modification is at the N- 
and/or C-terminal a-carbon reactive group, and 
C-terminal modification requires extra steps; 

(c) the amino acid sequence of the desired 
40 polypeptide, especially the number of different side 

chain reactive groups and whether a cleavage recognition 
sequence is present in the sequence of the polypeptide; 
£md 
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(d) the type of modification, some types of 
modification do not require protection of the side chain 
groups • 

A preferred variation of the multicopy method 
5 of the invention is based upon the demifusion protein 
concept. The intercoxmecting peptide contains two 
unique cleavage sites and one of which is optionally the 
same as appears in the intracoxmecting polypeptide. The 
binding protein is modified so that the cleavage sites 

10 of the inter and intraconnecting peptides do not appear 
in the binding protein. After separation and 
purification of the fusion protein, cleavage with a 
first cleavage agent releases the demifusion protein 
containing multiple copies of the desired polypeptide 

15 and a short peptide sequence (i.e., the interconnecting 
peptide) as the biological protecting group for the N- 
terminal a-amine. The side chains of the demifusion 
protein are protected. The interconnecting peptide 
residue acts as a protecting group for the N^terminal a- 

20 amine, the copies themselves act as protecting groups 
for the internal N- and C-terminal groups of the 
internal copies and the C-terminal of the demifusion 
protein is protected with another amino acid such as an 
arginine. After protection, the cleavage agents are 

25 added to cleave the N-terminal biological protecting 
group if desired, to release the various copies of the 
desired polypeptide and to create free N- terminal amines 
or free C-terminal carboxylic acids as desired according 
to the specific nature of the intraconnecting peptide 

30 residue. Chemical modification at the N-terminal or C- 
terminal or both followed by removal of the protecting 
groups and the residue on the termini produce the 
desired N- or C-terminal modified polypeptide. 
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A. Preparation of the Starting Materials: Forming the^ 
Recombinant Polypeptide Biologically Protected at 
the H- and/or C-Termlnal g-Carbon Reactive Group 

1. Selecting the Desired Peptide and the 

5 Modification 

A polypeptide Is a polymer of amino acids 

linked by amide bonds having a terminal amino acid with 

a reactive a-amlne group at one end (N- terminal) and a 

terminal amino acid with a reactive a-ceurboxyl group at 

10 the other end ( C-terminal ) • A polypeptide typically has 
at least one reactive or functional amine group 
including the N- terminal a-amine group. In addition, 
the polypeptide can have one or more reactive side 
chains including £-amino^ groups of lysine. Other amino 

15 acids have side chains with reactive or functional 
groups like thiol, hydroxyl, phenolic hydroxyl, 
imidazole and carboxylic acid groups. A recombinantly 
produced polypeptide is a polypeptide that is produced 
by isolating or synthesizing the gene for the 

20 polypeptide and introducing the gene into a vector which 
allows for the amplification and manipulation of 
expression of the gene in a host organism. 

The starting material is selected, designed and 
then recombinantly produced. The starting material is 

25 selected according to such factors as: 

(a) the characteristics of the desired 
polypeptide including the desired modification, size and 
amino acid composition; 

(b) whether the modification is to be made at 
30 the N- and/or C-terminal amino acid o-carbon reactive 

group requiring biologically added protecting groups at 
one or both ends of the molecule; and 

(c) ease of purification, to enhance 
purification of the recombinantly produced polypeptide a 

35 single or multicopy fusion protein can be formed. 

Before the starting material of the invention 
is formed, the desired polypeptide is selected because 
of its function, size, and amino acid composition. 
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The function of the polypeptide selected for 
the method of the invention can be altered by selective 
modification of the N- and/or C-terminal amino acid. 
Modifications to the polypeptide can change the 
5 structural characteristics and/or the biological 

activity of the polypeptide. For example ^ C-terminal 
amidation of many small peptides, like mastoparan or the 
human gastrin releasing peptide, enhances the biological 
activity. of these peptides. In another example, N- 

10 terminal reaction with a synthetic orgaoiic moiety or a 
synthetic organic /oligopeptide moiety significantly 
alters the biological activity of these peptides. 
Moreover, addition of peptides having D- or L-amino 
acids can provide for targeting of the polypeptide to a 

15 specific cell type, changing the rate of breakdown and 
clearance of the peptide, increasing the biological 
potency and increasing the biological activities of the 
polypeptide. Addition of D-amino acids or peptides or 
derivatives of amino acids can also result in the 

20 formation of antagonists. The choice of polypeptide and 
modification can be made based upon the desired change 
of the structural or biological activity of the peptide. 
The especially preferred modification is C-terminal 
amidation of a peptide. 

25 Several examples of modified polypeptides and 

the changes in biological activity associated with this 
modification are described in Kirk-Othmer Encyclopedia 
of Chemical Technology . 4th Edition, Vol. 12, pp. 
603-617 (1991), which is hereby incorporated by 

30 reference. 

The size of the selected polypeptide can range 
from a peptide of about 4 amino acids to a polypeptide 
of about 4000 amino acids (about 500,000 daltons). The 
larger polypeptides are typically recombinantly produced 

35 as a single copy fusion protein or polypeptide. Smaller 
peptides having 50 amino acids or less are preferably 
produced as multicopy fusion proteins or polypeptides. 
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Especially preferred are small biologically active 
peptides having 50 amino acids or less. 

The amino acid composition of the desired 
polypeptide can have a multiplicity of side chain 
5 functional reactive groups, but the method is preferably 
directed to polypeptides having one or two types of 
reactive side chain groups. For example, especially 
preferred polypeptides are those having only examine 
groups as reactive side chain groups. Other especially 

10 preferred polypeptides are those having e^amLno and 
hydroxyl or carboxyl side chain groups. Many small 
biologically active peptides, like the magainin 
polypeptides, have limited types of functional or 
reactive side chain groups. 

15 Specific examples of polypeptides having one or 

two types of reactive side chain groups include the 
magainin polypeptides I, II and III, as disclosed by 
Zasloff et al. in U.S. Patent No. 4,810,777 (issued 
Meurch 7, 1989); and wound healing peptide like 

20 Ala-Phe-Ser-Lys-Ala-Phe-Ser-Lys-Ala-Phe-Ser-Lys-Ala-Phe- 
Ser-Lys-Ala-Phe-Ser-Lys (SEQ ID N0:1), as disclosed by . 
Berkowitz et al. in U.S. Patent No. 5,045,531 (issued 
September 3, 1991). These disclosures are hereby 
incorporated by reference. 

25 Other examples of suitable polypeptides include 

the magainin polypeptide 1, magainin polypeptide 2, 
magainin polypeptide 3, wound healing peptides, myosin 
light chain kinase inhibitor, substance P, mastopeuran, 
mastoparaut X, human amylin, rat amylin, Icaria 

30 chemotactic peptide, carassin, human gastrin releasing 
peptide, kemptamide, myosin kinase inhibiting peptide, 
melettin, [Leu^ ] -enkephalamide , [Met^ ] -enkephalamide , 
metrophenamide, ScPg, allatostatin 1, allatostatin 3, 
crustacean cardioactive peptide, FMRF (molluscan 

35 cardioexcitatory neuropeptide), FMRF-like peptide Fl, 
neuromedian B, bombesin, leukopyrokinin, alyetesin, 
corazonin and littorin. 
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Once the desired polypeptide and modification 
is selected, the starting material can be designed and 
recombinantly produced so that the N- and /or terminal 
a-carbon reactive group has a biologically added 
5 protecting group, 

2. Selecting The Biologically Added Protecting 

Groups To Be Added to the H- and/or C-Terminal 
g^Carbon Reactive Group of the Polypeptide 

10 Before the starting material is formed, the 

biologically added protecting groups are selected. The 
biologically added protecting groups can be a 
polypeptide, peptide and/or amino acid linked by an 
amide bond connection to the N- and/or C-terminal 

15 a-carbon reactive group. The type of bond formed is 
generally irreversible and the sequence of the 
biological protecting group contains at least one site 
that is cleavable enzymatically or chemically so that 
the biological protecting group can be selectively 

20 removed. Preferably, the sequence of the biologically 
added protecting group is not present in the desired 
polypeptide. When both the N- and C-terminal a-carbon 
reactive groups are protected with the biologically 
added protecting groups, the biologically added 

25 protecting group at the N- terminal a-carbon reactive 
group is preferably different from the group at the 
C-terminal a-carbon reactive group to allow for 
sequential cleavage of the N- and C-terminal 
biologically added protecting group. 

30 The biologically added protecting group has at 

least one cleavage site to provide for removal of all or 
part of the biological protecting group. Specific 
examples of peptides and amino acids that can sexnre as a 
cleavage site in biological protecting groups and the 

35 cleavage enzymes or conditions are provided in Table 1. 
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TABLE 1 



10 



Enzymes 

for Cleavage 

Enterokinase 



Factor Xa 
Thrombin 



Biological 
Protecting Groups 

(Asp)^Lys 
(SEQ ID N0:2) 



DMA Seg, 

GACGACGACGATAAA 
(SEQ ID NO: 12) 



15 Ubiguitin 

Cleaving Enzyme 



IleGluGlyArg 
(SEQ ID N0:3) 

ArgGlyProArg 
(SEQ ID N0:4) 

ArgGlyGly 



ATTGAAGGAAGA 
(SEQ ID N0:13) 

A6AGGACCAAGA 
(SEQ ID NO: 14) 

AGAGGAG6A 



Renin 



HisProPheHisLeu- CATCCTTTTCATC- 



leuValTyr 
(SEQ ID N0:5) 



TGCTGGTTTAT 
(SEQ ID N0:15) 



20 



25 



30 



35 



40 



45 

The biological protecting group can contain 
more than one enzymatic and/or chemical cleavage site, 
and preferably contains at least one site cleaved by a 
50 chemical reagent and at least one site cleaved by an 

enzyme. Alternatively, the biological protecting group 
can have at least two different enzymatic sites of 



Trypsin 


Lys or Arg 


AAA 


OR CGT 


Chymotrypsin 


Phe or Tyr 
or Trp 


TTT 


or TAT or 
TGG 


Clostripain 


Arg 


CGT 




S. aureus V8 


Glu 


6AA 




Chemical 
Cleavage 


Biological 
Protecting Groups 


DNA Seg. 


(at pH3) 


AspGly or AspPro 




GATGGA 


( Hydroxy lamine ) 


AsnGly 




AATCCA 


(CNBr) 


Methionine 




ATG 


BNPS-skatole 


Trp 




TGG 


2-Nitro-5- 
thiocyanobenzoate 


Cys 




TGT 
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cleavage or at least two different chemical cleavage 

sites. A specific example of a biological protecting 

group having multiple cleavage sites is exemplified by 

the following peptide (SEQ ID N0:6): 

5 Phe Val Asp Asp Asp Asp Lys^ Phe Val AsnB 

Gly Pro Argc Ala Hetp Phe Val Asp Asp Asp 

Asp Lys^ Val AsnB P^ro Argc Ala KetD Ala 

^ = cleavage site for enterokinase 

B = cleavage site for hydroxylamine 

10 G cleavage site for thrombin 

0 = cleavage site for cyanogen bromide 

The biological protecting group with multiple cleavage 

sites can also serve as an interconnecting or 

intraconnecting peptide. While not in any way meant to 

15 limit the invention, the combination of chemical and 
enzymatic cleavage sequence in biologically protected 
group provides for advantages in purification and 
cleavage efficiency. 

The biological protecting group can also be a 

20 combination of a polypeptide and a peptide like, for 

example, in a recombinant single copy fusion protein. A 
recombinant single copy fusion protein has three 
tandomly coupled segments. The first segment is a 
binding protein, the second segment is an 

25 interconnecting peptide, and the third segment is the 
single copy polypeptide. The interconnecting peptide 
connects the binding protein to the single copy 
polypeptide at either the N- or C-terminal a-carbon 
reactive group. The interconnecting peptide has at 

30 least one chemical or enzymatic cleavage site and, 

preferably, has a sequence not found in the single copy 
polypeptide. The interconnecting peptide and optionally 
the binding protein as the biologically added protecting 
group at the N-terminal a-amine or C-terminal a--carboxyl 

35 group and also provide for purification of the 
recombinantly derived single copy polypeptide. 

Another example is recombinant multicopy fusion 
protein composed of three tandomly coupled segments. 
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The first segment is a binding protein, the second 
segment is an interconnecting peptide, and the third 
segment is a multicopy polypeptide. The interconnecting 
peptide connects the binding protein to the N- or 
5 C-terminal a-carbon reactive group of the multicopy 

polypeptide. The multicopy polypeptide contains several 
copies of the single copy polypeptide connected by an 
intraconnecting peptide. The inter- and intraconnecting 
peptides both have at least one site that is cleavable 

10 and preferably do not contain amino acid sequence 
present in the single copy polypeptide. The 
interconnecting peptide and the intraconnecting peptide 
can act as biological protecting groups of the N- and/or 
C-terminal a-carbon reactive groups of the single or 

15 multicopy polypeptide. When both the C-terminal and 
N-terminal a-carbon reactive groups are to be modified, 
preferably the inter- and intraconnecting peptide have 
different cleavage sites to provide for sequential 
cleavage. 

20 Once the polypeptide and the desired 

modification are selected, the protecting groups to be 
biologically added to the N- and/or C-terminal a-carbon 
reactive groups are selected. The factors for selecting 
the biologically added protecting groups to be combined 

25 with the desired polypeptide include: (a) the amino 
acid sequence of the single copy polypeptide; 

(b) whether the polypeptide is going to be recombinantly 
produced as a single or multicopy polypeptide; 

(c) whether a single or multiple cleavage site is 

30 desired; (d) whether enzymatic or chemical cleavage is 
desired; (e) whether a fusion protein is desired to 
provide for purification; and (f) compatibility of the 
amino acid sequence of the biological protecting group 
with the chemical protecting agents. 

35 
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3 . Forming the Recombinant Single or Multicopy 
Pol3nE»eptide Protected with One or More 
Biologically Added Protecting Groups at the 
N- and/or C-Terminal a-Carbon Reactive Groups 
5 By Standard Recombinant DNA Methodology 

The single or multicopy polypeptide or fusion 

protein starting material of the method of the invention 

is formed by standard recombinant DNA methods. The gene 

sequence for the desired polypeptide or a portion 

10 thereof can be cloned or, in the case of a smaller 

peptide, synthesized by automated synthesis. The gene 
sequence encoding the biologically added protecting 
group is synthesized by automated oligonucleotide 
synthesis. The gene sequence for the biologically added 

15 protecting group is combined with the gene sequence for 
a single or multicopy polypeptide or a portion thereof 
so that the single or multicopy polypeptide produced has 
at least one cleavable biologically added protecting 
group at the and/or C-tesrminal a-carbon reactive 

20 group. 

The gene sequence for the biologically added 
protecting group encodes a polypeptide, peptide, amino 
acid, or a combination thereof. Preferably, the gene 
sequence encodes a peptide of less than about 50 amino 

25 acids and provides for one site of cleavage by a 

chemical reagent and at least one site of enzymatic 
cleavage. Once the biological protecting group is 
selected, the DNA sequence is formed by automated 
synthesis and combined with the gene sequence for the 

30 single or multicopy polypeptide by standard recombinant 
DNA methodologies. Specific examples of the DNA 
sequences that correspond to amino acid cleavage sites 
are provided in Table 1. The DNA sequences encoding 
chemical and enzymatic cleavage sites can be combined 

35 into a gene sequence for a single biological protecting 
group by automated oligonucleotide synthesis. 

The single or multicopy polypeptide can also be 
formed as a recombinant single or multicopy fusion 
protein. The fusion protein has three tandomly coupled 
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segmen-ts. The first segment is a binding protein, which 
exhibits strong, reversible binding to a ligand for the 
binding protein, prefercibly a reversible inhibitor for 
an enzyme or enzyme-like binding protein. The second 
5 segment is an interconnecting peptide, which is 
selectively cleavable by an enzyme and/or chemical 
technique. The third segment is the single or 
multicopy polypeptide. The binding protein with 
interconnecting peptide provides for purification of the 

10 recombinantly produced single or multicopy polypeptide 
and acts as a biological protecting group for the N- or 
C-terminal a-carbon reactive group. Although the 
binding protein and interconnecting peptide can both 
serve as the biological protecting group, in a preferred 

15 embodiment, the interconnecting peptide contains two 

selective cleavage sites so that the binding protein can 
be removed and the interconnecting peptide will remain. 
After purification in this preferred embodiment, the 
binding protein can be cleaved to leave a peptide 

20 fragment, i.e., the interconnecting peptide, which 
serves as the biological protecting group. The 
resulting demifusion protein eliminates the need to 
carry the binding protein peptide sequence through the 
remaining steps and preserves the biological protecting 

25 group benefits derived from the binding protein. Single 
or multicopy fusion proteins are produced by standard 
recombinant DMA methodology, as discussed in co-pending 
application Serial No. 07/552,810, which is hereby 
incorporated by reference. Formation of recombinantly 

30 produced single or multicopy fusion proteins is 
described. 

The binding protein segment of the fusion 
protein generally is an antibody, an antibody L or H 
chain, an enzyme, a lectin, avidin or any expression 
35 protein having a binding site for selective binding to a 
ligand such as an antigen, a substrate, an inhibitor, a 
sugar or biotin. Preferably, the binding protein is an 
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enzyine**like protein Including but not limited to an 
enzyme or a truncated, altered or modified functional 
version thereof (hereinafter the modified functional 
version) • The binding is preferably strong and 
5 selective. Preferably for an enzyme the ligsind is a 
reversible inhibitor for the enzyme-like protein. 

Especially preferred embodiments of the enzyme 
binding protein include carbonic anhydrase derived from 
any source, especially mammalian or human, and a 

10 modified functional version thereof which will bind with 
the inhibitor, sulfanilamide or derivatives thereof. An 
especially preferred embodiment of the modified carbonic 
anhydrase enzyme is a functional version which (I) does 
not contain methionine, (II) has all or some glutamates 

15 replaced by another amino acid, preferably aspartate, 
(III) has all or some arginines replaced by another 
amino acid, preferably lysine, (IV) has asparagines next 
to glycine replaced by another amino acid, preferably 
glutamine or glycine changed to alanine, (V) has 

20 methionine replaced by another amino acid, preferably 
leucine, (VI) has cysteine replaced by another amino 
acid, preferably serine, and (VII) has methionine at 
position 240 replaced by another amino acid, preferably 
leucine or serine and isoleucine. 

25 Antibodies or individual chains, regions or 

fragments thereof, as characterized above, and other 
proteins, which will strongly, biospecif ically and 
reversibly bind to a low molecular weight ligand, can 
perform the same function in the same way to reach the 

30 same result as the enzyme-like protein in the context of 
the protein purification construct, and consequently are 
also preferred within the invention as binding proteins* 
For antibodies or the corresponding chains, regions or 
fragments, the ligand is a low molecular weight antigen, 

35 preferably an aromatic moiety such as dinitrophenol . 
Suitable binding proteins and their 
corresponding ligands include those provided in Table 2. 
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TMLB 2 





Bindine Protein 


Lieand 


Kd 


Ref. 




Xanthine Oxidase 


Allopurinol 


strong 


1 


mJ 


AucxAVJ o A *i c ^*cniii 1 iigac 




^1 OTP 1 A 






AUCaaU a Ali w CI Hi Al iCI 9 c 


B u jkjr i* o £ o noy c ill 




9 

2 




Aoenu s me ob bhixzi&s b 


erytnro-»- (z-nycirozy-3 
nonyl) adenine 




2 




Dihydrofolate reductase 


Methotrexate 


1.2E-9 


4 


10 


Dihydrofolate reductase 


Methotrexate 


2.3E-9 


5 




Dihydrofolate reductase 


Aminopterin 


3.7E-9 


5 




Dihydrofolate reductase 


Trimethoprin 




5 




Ribulose bisphosphate 
carboxylase 


2 carboxyarabirital 
1,5 bisphosphate 


lE-14 


6 


15 


Pepsin 


Pepstatin 


lOE-9 






Calmodulin 


Melittin 


3E-9 


7 




Calmodulin 


Various peptides 


0.2E.9 


7 




Cholesterol esterase 


Borinic acid 


O.lE-9 


6 




Carbonic anhydrase II 


Sulfanilamide 


4,6E.7 


3 


20 


Carbonic anhydrase II 


Acetazolamide 


6 E-10 


3 



E is times ten to the negative exponent indicated. 
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Other suitable binding proteins include 
p-galactosidase as described by Hanada et al., J. Biol. 
Chem . , 263 :7181 (1988); flagellin protein as described 
by Stahl et al., U.S. Patent No. 4,801,526 (issued 
5 January 31, 1989); ubiquitin, Yoo et al., J. Biol. 

Chem. , 264 :17078 (1989); protein A, B. Nillson et al., 
EMBO Journal > 4:1075 (1985); streptavidin, Meade et al., 
PCT/US 85/01901 (1986); and the flag peptide, K. Italcura 
et al.. Science , 198 :1056 (1977), which are hereby 

10 incorporated by reference. 

The choice of the interconnecting or 
intraconnecting peptide for the single or multicopy 
fusion protein depends upon the choice of cleavage 
enzyme and product peptide sequence. In general, the 

15 interconnecting peptide sequence constitutes any peptide 
sequence that uniquely reacts with a highly specific 
cleavage enzyme or by a highly specific chemical reagent 
cleavage, or combination thereof, like those shown in 
Table 1. The interconnecting or intraconnecting peptide 

20 is connected to the N- and/or C-terminal a«*carbon 

reactive group and also serves as a biologically added 
protecting group. 

Generally, the interconnecting peptide, and the 
intraconnecting peptide fragments will have different 

25 amino acid sequences so that they can be sequentially 
rather than simultaneously cleaved. The amino acid 
sequences can be chosen also so that the cleavage 
sequence does not duplicate any amino acid sequence of 
the product peptide(s). Alternatively, the cleavage 

30 specific amino acids in the peptide can be blocked or 
protected from the cleavage reaction as provided in the 
method of the invention. These peptide and/or amino 
acid coimecting fragments can be chosen from the same 
group of amino acid unit sequences for example, those 

35 listed in Table 1. The factors to consider in choosing 
these peptide connecting fragments are similar to those 
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for selecting other biological protecting groups and 
Include the following: 

a) The amino acid sequence of the product 
peptides ; 

5 b) Whether the polypeptide Is a single or 

multicopy polypeptide; 

c) Whether a single or multl cleavage site Is 
desired; 

. d) Whether enzymatic or chemical cleavage Is 
10 desired; 

e) Whether the Intra- and lnterconnectlng[ 
peptides and the gene fragments coding for them are 
positioned and altered to provide for diversity in 
the gene sequence for the variable fused peptide. 
15 This diversity allows efficient expression of 

multiple units of a small peptide. It has been 
discovered that a continuously repetitive genetic 
sequence will often be rearranged or deleted by the 
host organism prior to recombination. 
20 The recomblnantly produced single or multicopy 

polypeptide with N- and/or C-termlnal biologically added 
protecting groups is produced by standard recombinant 
DNA methods. An expression cassette can be formed by 
combining the gene for the single or multicopy 
25 polypeptide and the gene encoding the desired biological 
protecting group with trsmscriptional and translational 
control regions. For example, the recombinant gene 
encoding the fusion protein Incorporates three DNA 
segments coding for the binding protein, the 
30 interconnecting peptide and the single or multicopy 
polypeptide. The segments are arranged so that either 
the binding protein gene fragment or the single or 
multicopy polypeptide fragment can be read first. It Is 
preferred to construct the fusion protein gene so that 
35 the binding protein gene fragment is read first. The 
gene segments can be synthetic or derived from natural 
sources. The fusion protein gene is combined with 
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transcriptional and translational control regions to 
form an expression cassette. 

An expression vector containing the expression 
cassette is capable of providing for expression of the 
5 biologically protected single or multicopy polypeptide 
in prokaryotic or eukazryotic cells. The expression 
vector incorporates the single or multicopy polypeptide 
gene and base vector segments such as the appropriate 
regulatory DNA sequences for transcription, translation, 

10 phenotyping, temporal or other control of expression, 
RNA binding and post-expression manipulation of the 
expressed product. The expression vector generally will 
include structural features such as a promoter, an 
operator, a regulatory sequence and a transcription 

15 termination signal. The expression vector can be 

synthesized from any base vector that is compatible with 
the host cell or higher organism and will provide the 
foregoing features. The regulatory sequences of the 
expression vector will be specifically compatible or 

20 adapted in some fashion to be compatible with 
prokaryotic or eukaryotic host cells or higher 
organisms. Post-expression regulatory sequences, which 
cause secretion of the fusion protein can be included in 
the eukaryotic expression vector. It is especially 

25 preferred that the expression vector exhibit a 

stimulatory effect upon the host cell or higher organism 
such that the fusion protein is overproduced relative to 
the usual biosynthetic expression of the host. 

In one preferred scheme for construction of the 

30 vector, the DNA segment for the binding protein, for 
example the human gene for carbonic anhydrase II, (the 
binding protein gene) is inserted into a base plasmid 
which is compatible with the host cell to be 
transformed. The base plasmid contains the necessary 

35 regulatory sequences for high level expression of genes 
placed downstream. 
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A synthetic DNA sequence coding for the 
interconnecting peptide is then inserted near the 3' end 
of the binding protein gene. A restriction enzyme site 
near the 3' end of the binding protein gene should be 
5 present to enable insertion of this DNA sequence for the 
inter-connecting peptide. Also, at least one convenient 
restriction enzyme site ( inteannediate vector restriction 
site) should be designed into the synthetic DNA sequence 
for the interconnecting peptide so that DNA segments 

10 coding for the variable fused polypeptide can later be 
inserted in the correct reading frame. If no such sites 
are already present, they can be introduced at this 
point in the scheme by a site-specific mutagenesis after 
standard procedures described in Sambrook, J., Fritsch, 

15 E.P. and Maniatis, T,, Molecular Cloning, A Laboratory 
Manual > Cold Spring Harbor Laboratory, Cold Spring 
Harbor, N.Y. (1989), the disclosure of which is 
incorporated herein by reference. 

The resulting vector construct is the 

20 intermediate base vector for the in situ construction of 
the fusion protein gene integrated into the larger 
vector. Any naturally occurring or synthetic DNA 
sequence encoding a single or multicopy polypeptide can 
be inserted into the intermediate vector restriction 

25 site to yield a fusion protein gene integrated into the 
expression vector. Proper insertion and reading frame 
alignment can be verified by known techniques such as 
sequencing the junction region between the binding 
protein gene and the DNA sequence for the variable fused 

30 polypeptide according to methods described in Sambrook 
et al. 

In a second alternative, after ligating 
together any two adjacent DNA segments, the resulting 
intermediate gene can be transferred to the base vector 
35 by the restriction and ligation methods described above. 
The third DNA segment (i.e., the binding protein gene or 
variable fused polypeptide gene) can be inserted into 
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the base vector carrying the intermediate gene pursuant ^ 
to the Sambrook techniques including construction of 
appropriate restriction sites, if needed, and ligation 
procedures described above. All protocols for 
5 restriction, insertion, ligation and the like follow 

standard procedures such as those described by Sambrook, 
cited supra. 

Preferred base vectors include any plasmid that 
is compatible with the particular host, is stable in 

10 that host and allows for a positive selection of the 
transformed host. Such vectors include, for example, 
pTZ18/l9U/R or pPL-lambda as well as those characterized 
in P.H. Pouwels, B.E. Enger-Valk, and W.J. Branimer, 
Cloning Vectors ^ Elsevier Science Pub. (1985) the 

15 disclosure of which is incorporated herein by reference. 

The final recombinant expression vector will 
carry an appropriate promoter, a sequence coding for a 
ribosome binding site, phenotype genes for selection, 
and regulatory regions for transcription, translation 

20 and for post-translational intracellular manipulation of 
the resulting biologically protected single or multicopy 
polypeptide • 

The expression vector is introduced into 
prokaryotic or eukaryotic host cells by standard methods 

25. like calcium phosphate precipitation, electroporation 

and microinjection. Isolation of host cells transformed 
with the final recombinant expression vector is 
accomplished by selecting for the phenotype or other 
characteristic that is designed into the recombinant 

30 vector. Generally, such selection characteristics 
include antibiotic resistance or complementation of 
deficient functions in the host. Preferred phenotype 
genes for the recombinant vector of the invention 
include antibiotic resistent phenotypes, essential amino 

35 acid phenotypes and other essential compound phenotypes. 

Preferably, an inducible expression system is 
used so that the selected, transformed host cell will be 
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grown to an early- to mid- logarithmic phase and treated 
with an induction compound to cause the biologically 
protected single or multicopy polypeptide to be 
produced. Typically, incubation will be continued for 
5 up to several hours (the most appropriate incubation 
time for each single or multicopy polypeptide is 
determined by sampling at differing times during a test 
incubation) , at which point the cells are harvested and 
lysed. If the transformed host cell is designed to 

10 secrete the biologically protected single or multicopy 
polypeptide, the culture is grown until an appropriate 
and/or desired concentration of the polypeptide is 
present in the culture medium. If the host cell is one 
that will contain dissolved polypeptide in its 

15 cytoplasm, the culture is grown until it reaches optimum 
maturity. The mature culture is then lysed with an 
appropriate agent to release the polypeptide. If the 
polypeptide or fusion protein is deposited as insoluble 
granules in the host cell, the mature cell culture is 

20 lysed and the released insoluble gr£uiule8 are dissolved 
in chaotropic agents. This incubation, growth and 
lysing process can be conducted in a batch or continuous 
manner. 

The transformed cells are capcd>le of e^ressing 
25 polypeptides containing multiple copies of the 

polypeptides up to a molecular weight of the largest 
protein naturally expressed by the cell. For 
prokaryotic cells, this means that the size of the 
recombinant protein expressed usually will be smaller 
30 than about 500,000 daltons. This is the size of certain 
enzymes naturally produced, for example by E. coli and 
Bacillus subtilis > as disclosed by B. Lewin, in Genes, 
4th Edition, pages 606-607, Oxford Press, New York, NY 
(1990), which is incorporated herein by reference. 
35 Although eukaryotic cells utilize proteins of a larger 
size than about 500,000 daltons, typically those larger 
proteins are expressed as subunits and assembled by 
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post-expression manipulation in such cells. Examples of 
such larger proteins include hemoglobin and antibodies. 
Although not meant in any way to limit the invention, it 
is believed that the expression of very Icirge proteins 
5 (greater than 500,000 daltons) is limited by the 
trans lational error frequency which approaches 50% 
during synthesis of a very large protein. 

Other factors, as. well, can influence the 
control and extent of expression of the fusion protein 

10 in cells transformed with recombinant expression vector. 
Optimal expression of a multicopy expression cassette or 
vector can be achieved if the recombinant expression 
vector is constructed using these factors. 

The first factor is that the gene sequence for 

15 the multicopy protein should have variations in the gene 
sequence. This variation avoids a high degree of 
repetition along the gene sequence and the protein 
sequence. Such repetition endangers both the gene and 
expressed fusion protein because the cell will recognize 

20 the repetition sequence and excise or assimilate the 
sequence or protein. 

The second factor is that the binding protein 
gene segment should have a size like that for an enzyme. 
The size minimizes or prevents variation of 

25 translational efficiency due to the needed variation of 
the gene segment for the desired protein. The latter 
gene segment variation is important for the reason 
mentioned above. If the leader secpience is short, the 
cell will recognize a variation in the tail sequences as 

30 a signal to lower the expression efficiency for the 
protein. 

The third factor is that certain polypeptides 
present in the multicopy alternative achieve a greater 
increase in yield efficiency than others. This 
35 efficiency depends on the ratio of the weight of the 
binding protein to the weight of the desired protein. 
Above a certain number of copies, the yield efficiency 



wo 94/01451 



PCrAJS93/06591 



does not: appreciably Increase for total molecular 
weights greater than 250,000 daltons. 

The fourth factor is that the expressed protein 
should be soliible or form granules (inclusion bodies) in 
5 the cytoplasm of the transformed cell. Purification and 
post-expression manipulation of the fusion protein is 
more readily accomplished when the fusion protein is 
soluble or forms granules. 

The fifth factor is that a strongly bound 

10 inhibitor /enzyme couple is employed to separate and 
purify the fusion protein. In order to achieve this 
goal, the fusion protein should maintain essentially the 
same binding constant between the enzyme and its 
inhibitor as is exhibitecl by the free enzyme in the 

15 inhibitor. 

Although the formation of a recombinantly 
produced single or multicopy fusion protein has been 
described, the techniques described above can also be 
used to add a different polypeptide, peptide and/or 

20 amino acid as a biologically added protecting group to 
the N- and/or C-terminal end of the single or multicopy 
polypeptide. For example, in the method described above 
if the binding protein is eliminated, the 
interconnecting peptide is sufficient itself as a 

25 biologically added protecting group. In another 

example, the biologically added protecting group can be 
as simple as a single amino acid added to the and/or 
C-*terminal amino acids of the single copy polypeptide. 
In an alternative version, the single or 

30 multicopy polypeptide can be recombinantly produced as a 
truncated polypeptide having only a portion of the amino 
acid sequence of the desired polypeptide. The 
recombinantly produced truncated single or multicopy 
polypeptide preferably lacks about 1 to about 10 amino 

35 acids at the N- or C-terminal end of the molecule. The 
gene for the truncated single or multicopy polypeptide 
cem be synthesized by automated synthesis or can be 
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obtained by restriction endonuclease cleavage of entire " 
gene sequence so that the coding sequence for up to 10 
amino acids is removed. The truncated gene can be 
combined with the gene sequences for the binding protein 
5 and interconnecting peptide or any other biologically 
added protecting group as described herein. The amino 
acids missing from the truncated single or multicopy 
polypeptide are later replaced by modification reaction. 
The preferred starting material for the C- 

10 and/or N-terminal selective modification method of the 
present invention is a multicopy fusion protein having 
several copies of polypeptide tandomly linked and 
intraconnected via an amino acid and interconnected via 
a peptide to the binding protein. An example of the 

15 preferred multicopy fusion protein is comprised of a 

human carbonic anhydrase II binding protein, optionally 
modified by conversion of the methionine 240 to leucine, 
isoleucine interconnected by an enterokinase recognition 
site or cyanogen bromide and hydroxyl amine recognition 

20 sites to the N- terminal a-amine of a multicopy 

polypeptide having three tandomly linked copies of the 
polypeptide mastoparan intraconnected with the amino 
acid arginine, and having a C-terminal arglnine. The 
human carbonic anhydrase II binding protein option 

25 enables removal of the binding protein after its 

usefulness in separation and purification is finished. 
This option eliminates the chemical processing of the 
binding protein sequence that does not form part of the 
final desired polypeptide. The benefits include but are 

30 not limited to increased solubility of the demifusion 
protein. Increased facility to manipulate the demifusion 
protein in subsequent processing, increased ability to 
perform separation cuid purification of the demifusion 
protein and early elimination of peptide sequences not 

35 appearing in the final product. 

An expression cassette for the human carbonic 
anhydrase mastoparan fusion protein is formed as 
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follows. The especially preferred gene for the human - 
carbonic anhydrase II binding protein is obtained as 
described in copending application Ser. No. 07/552,810. 
When employing the hcAII gene, at least a portion 
5 representing the functional fragment of the enzyme is 
modified as follows: (a) the hCAII asparagine-glycine 
peptide sequence is changed; the asparagine is changed 
to glutamine or glycine is changed to alanine; and (b) 
the sequence for the last three terminal amino acids is 

10 deleted. Optionally, the hCAII is further modified to 
convexTt methionine 240 to leucine, isoleucine or serine. 
(Hini modified hCAII). 

The modified hCAII gene sequence can be 
inserted into an expression vector which is compatible 

15 with E_2. coli . Cleavage of the DNA sequence at a site 
downstream from the regulatory portion of the vector 
followed by insertion of the gene through blunt* or 
sticky-end ligation forms the recombinant vector. The 
insertion is downstream from the promoter sequences that 

20 provide for expression in the host cells. The promoter 
is preferably the T7 promoter. The T7 promoter is 
recognized by a chromosomally encoded T7 RNA polymerase 
induced by isopropyl-thiogalactoside. 

A short DNA fragment coding for the inter- 

25 connecting peptide is inserted near the 3' or 5' end of 
the intact or partial hCA gene ( intraconnecting peptides 
are discussed below) . In a preferred version, the 
peptide sequence recognized by enterokinase or the 
peptide sequences recognized by cyanogen bromide (Met) 

30 and hydroxy lamine (Asn) is inserted at the 3' terminal 
of the carbonic anhydrase. Preferably, the chemical 
recognition sequence is spaced with 61y so that the 
sequence reads: Met-Gly-Asn. 

The gene fused onto the carbonic anhydrase 

35 II -enterokinase recognition site construct encodes three 
copies of the mastoparan sequence separated by Arginine 
residues (45 amino acids including C-terminal arginine). 
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The amino acid sequence for mastoparan is Ile-Asn-Leu- 
Lys-Ala-Leu-Ala-Ala-Ala-Leu-Ala-Lys-Lys-Ile-Leu (SEQ ID 
N0:7). This gene is prepared synthetically by the 
method of multiple complimentary oligonucleotide 
5 synthesis as described by S. Beaucage et al., Tetra , 
Letters, 221 :859 (1981), and is designed using optimal 
codon usage for coli and contains unique and useful 
restriction endonuclease sites. The synthetic gene is 
inserted into the expression vector immediately 
10 downstream from the enterokinase recognition site by 
standard recombinant DNA methodology* 

E* coli cells are transformed with the 
expression vector and transformed cells are selected. 
The expression of the protein in the cells is induced 
15 with isopropyl-thiogalactoside. Once sufficient protein 
has accumulated, the cells are lysed and the fusion 
protein purified. 

4. Purification of Single or Multicopy Fusion 
Protein 

The recombinant single or multicopy polypeptide , 
produced as a fusion protein allows for easy 
purification by affinity chromatography. The fusion 
protein produced in the transformed cells can be soluble 
in the cells or insoliible in inclusion bodies. Soluble 
fusion protein is obtained by lysis of the transformed 
cells to form a crude cell lysate. The crude cell 
lysate can be further purified by methods including 
ultrafiltration and ion exchange chromatography before 
purification by affinity chromatography. Insoluble 
fusion protein in inclusion bodies is also purified by 
similar methods. 

To perform affinity purification, the crude 
mixture of materials is combined with an immobilized 
ligand for the binding protein. Examples of the binding 
protein, corresponding ligand and dissociation constants 
are given in Table 2. For the preferred carbonic 
anhydrase enzyme, or the preferred modified or mini 
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modified carbonic anhydrase enzyme, the ligand is 
sulfanilamide or a benzene sulfonamide derivative. 
Immobilization of the ligand on a solid support can be 
accomplished by the methods of W. Scouter, Methods 
5 Enzvmol. > 34 ^ 288-294 (1974); S. Marcus, Methods 

Enzvmol . , 34, 377-385 (1974); A. Matsura et al.. Methods 
Enzvmol • , 34, 303-4 (1974); R. Barker, Methods Enzvmol . , 
34 , 317-328 (1974); I. Hatsumoto, Methods Enzvmol . > 34, 
324-341 (1974), J. Johansen, Carlsberg Res> Commun. , 14 , 

10 73 (1976) and G. S. Bethell et al., J. Biol. Chem. . 254 , 
2572-2574 (1979); the disclosures of which are 
incorporated herein by reference. The fusion protein 
binds to the immobilized ligand through the reversible 
affinity of the binding protein for its ligand. The 

15 remaining constituents and debris of the crude mixture 
of materials can then be removed by washing or similar 
techniques . 

Two routes can be employed for further 
purification of the fusion protein. According to the 

20 first route, the single or multicopy fusion protein is 
dissociated intact from the immobilized ligand by 
washing with a strong competing ligand solution. 
Examples include cyanides, pseudocyanides such as 
thiocyanides , perchlorates , halide and similar strong 

25 Lewis bases. 

According to the second route, the immobilized 
single or multicopy fusion protein is contacted directly 
with cleavage reagent to release the single or multicopy 
polypeptide. To isolate the single or multicopy 

30 polypeptide in the second route, its mixture with 
cleavage enzyme can be combined with a means for 
molecular weight selection (e.g. partition 
chromatography dialysis, filtration based on molecular 
size or high pressure liquid chromatography on a 

35 "particle exclusion" base or ion exchange 

chromatography) such that the high molecular weight 
cleavage enzyme is separated from the free variable 
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fused peptide. Or, the mixture can be combined with an 
immobilized affinity material for the cleavage enzyme. 

The cleavage enzyme chosen will depend upon the 
interconnecting peptide chosen. Examples of cleavage 
5 enzymes and their cleavage sites are given in Table 1. 

The purification methods described above yield 
the starting materials for the method of the invention: 
a single copy fusion protein, a multicopy fusion 
protein, a single copy polypeptide, a multicopy 

10 polypeptide, or a truncated single or multicopy 

polypeptide. In a preferred embodiment, the single and 
multicopy polypeptides are recombinantly produced from a 
fusion protein. Both single copy and multicopy 
polypeptides can be recombinantly produced with 

15 additional residues at the N-terminal and/or C-terminal 
ends of the molecule without the presence of a binding 
protein or interconnecting peptide. 

In a preferred version, the human carbonic 
anhydrase, modified or mini-modified human carbonic 

20 anhydrase multicopy mastoparan fusion protein is 

isolated from cell lysates of transformed E. coli by 
ultrafiltration followed by ion exchange chromatography. 
The cell lysate material is then loaded onto an affinity 
column containing sulfanilamide. The bound fusion 

25 protein is then released from the affinity column by 
washing with potassium thiocyanate. If carbonic 
anhydrase, modified carbonic anhydrase or mini-modified 
carbonic anhydrase is used, the purified fusion material 
is then digested with enterokinase , and the multicopy 

30 polypeptide is purified from the carbonic anhydrase 
binding protein by ultrafiltration. The purified 
multicopy polypeptide is composed of 3 copies of the 
mastoparan intraconnected by arginine residues and has a 
C-terminal arginine residue and an unprotected 

35 N-terminal a-amine and other side chain groups. If the 
carbonic anhydrase binding protein is a mini -modified 
version, the purified fusion material is then digested 
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first with cyanogen bromide to cleave the carbonic 
anhydrase residue from the remainder of the fusion 
protein. The resultant demifusion protein is a purified 
multicopy polypeptide composed of three copies of the 
5 mastoparan that are intraconnected by arginine residues 
and has a C-terminal arginine residue and an N-terminal 
Asp4 Lys sequence or a Gly Asn sequence protecting the N- 
terminal a-eunine and has unprotected side chain groups. 

10 B. Cleavage and Reaction of the Starting Materials With 
Chemical Protecting Agents 

In order to selectively modify the desired 

recombinant polypeptide at the N- and/or C- terminal 

a-carbon reactive groups, "the other reactive side chain 

15 are protected by reaction with up to three chemical 
protecting agents. The biologically added protecting 
group at N- and/or C-terminal a-carbon is cleaved to 
provide an unprotected reactive N- and/or C- terminal 
o-carbon group available for modification. 

20 The number and sequence of the cleaving and 

reacting steps can vary depending on the starting 
material and modification. In some cases, the reaction 
scheme is conducted by reacting the starting material 
with the chemical protecting agent (s) first and then 

25 cleaving with a cleavage reagent specific for the N- 
and/or C-terminal biological protecting group. For 
example, if the starting material is to be modified at 
the N-terminal amino acid or if the cleavage site of the 
biologically added protecting group is present in the 

30 desired polypeptide, then the starting material is 

protected first and cleaved second. In other cases, the 
starting material is cleaved first and then reacted with 
up to three chemical protecting agents. For example, 
for modification at the C-terminal amino acid the 

35 starting material is cleaved and then reacted with the 
chemical protecting agents. 

Other variations in the number and sequence of 
the cleaving and reacting steps are possible. A 
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reaction scheme can be selected according to the factors 
provided In Table 3. 



TABLE 3 



Factor 

10 1. Is the cleavage 

recognition 
sequence 

of the biological 
protecting group 
15 present in the 

amino acid sec[uences 
of the polypeptide? 

2. Is the N-termlnal 
20 amino acid to be 

modified? 



Present In 
Starting 
Material 



Method 



Yes React with chemical 
protecting agents 
and then cleave 



No Can go either way 



Yes React with chemical 
protecting agents, 
then cleave. 



25 



30 



35 



40 



45 



3. Is the starting 

material a multicopy 
fusion protein? 



4. Are both and 
C<-termlnal amino 
acids to be modified? 

5. Does the modifica- 
tion reaction require 
protection of reactive 
side chain groups? 



No Cleave and then react 
with chemical 
protecting agents 

Yes Two cleavage steps 

required - one at the 
inter* and one at the 
intraconnecting 
peptides • 

Yes Extra steps of 
cleavange and 
modification required. 

Yes React with chemical 
protecting agent 
before modification. 

No Cleave and then 

modify. No reaction 
with chemical 
protecting agent 
required. 



Once a particular starting material has been selected 
and formed, the steps of the reaction scheme can be 
selected by according to the factors in Table 3. 

For example, for N- terminal modification of the 
5 preferred multicopy fusion protein, the following 
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reaction scheme is selected. The preferred multicopy - 
fusion protein is three copies of the mastoparan 
polypeptide:: intraconnected by arginine residues emd 
interconnected by the enterokinase recognition peptide 
5 to carbonic anhydrase and having a C-terminal arginine 
residue or the three intraconnected mastoparan copies 
are interconnected by a methionine glycine asparagine 
residue to the mini-modified carbonic anhydrase having a 
serine, isoleucine or leucine at 240. This demifusion 

10 protein precursor also has a C-terminal arginine 
residue. Neither the sequence for inter- or 
intraconnecting peptides is found in the single copy 
polypeptide, so the reaction scheme can go either way. 
However, since N-teinninal modification is desired, the 

15 multicopy fusion protein is reacted with a chemical 
protecting agent before it is cleaved. Since the 
starting material is a multicopy fusion protein, 
cleavage will involve reaction with a cleavage enzyme 
specific for the interconnecting peptide and the 

20 intraconnecting peptide which in this case are 

different. If the demifusion protein precursor is used, ' 
the methionine of the interconnecting peptide is first 
cleaved with cyanogen bromide to produce the N- terminal 
biological protecting group. The multicopy demifusion 

25 protein of this alternative is reacted with the chemical 
protecting agent before the second cleavage to release 
the several copies and the free N- terminal amine. Only 
the N-terminal a-carbon is to be modified so after the 
cleavage step no additional cleavage or modification 

30 reactions are necessary. The modification reaction is 
N-terminal acetylation reaction or an acylation with a 
synthetic organic acylating group requiring protection 
of the reactive side chain groups. The final product is 
mastoparan having N-terminal acetyl group or an N- 

35 terminal synthetic organic acyl group. This reaction 
scheme can be depicted as follows: 
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45 

multicopy fusion protein 



(1) react with chemical 
protecting agents 



side chain protected 
multicopy fusion protein 



(2) cleave with cleavage 
reagent specific for 
intraconnecting peptide 



side chain protected 
single copy polypeptide 



(3) cleave with cleavage 
reagent specific for 
interconnecting peptide 

V 

20 side chain protected 

single copy polypeptides 
with unprotected N*terminal a-amine 

I ( 4 ) modification 
25 V 

modified side chain protected 
single copy polypeptide 

I (5) deprotection 
30 ^ 

N-terminally modified 
single copy polypeptide 

35 1* Protection of Reactive Side Chain Groups 

with Chemical Protecting Agents: Amine, 
Hydroxyl, Carboxyl^ Thiol Protection 

The purified single or multicopy fusion protein 

and the single or multicopy polypeptide also contain 

40 amino acids with side chains having reactive groups like 
6-amine, hydroxyl, carboxyl and thiol groups. In 
addition, one of the terminal amino acid a-carbon 
reactive groups can also be unprotected. In order to 
provide for the selective modification at the N-terminal 

45 a-amine and/or C-termiiial a-carboxyl groups, these other 
reactive groups are protected so that they are 
unavailable to react with the modifying agent. 

The purified single or multicopy fusion protein 
and the single or multicopy polypeptide are reacted with 
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up to three chemical protecting agents . The protecting ~ 
agent is selected by the capacity to form a protecting 
group at a particular type of side chain reactive group, 
as will be described herein. More than one protecting 
5 agent cam be used depending on the different types of 
side chain reactive groups present in the single copy 
polypeptide . 

Preferably, the single copy polypeptide is 
selected in part because it has a limited number of 
10 different side chain reactive groups to minimize the 
number of chemical protecting agents that are employed. 
For example, preferably, the single copy polypeptide is 
mastoparan which contains e-amine and hydroxyl groups as 
reactive side chain groups. 

15 

a. Amine Protection 

A single or multicopy recombinant polypeptide 
having at least one reactive amine group is reacted with 
a chemical protecting agent to form an amine specific 

20 protecting group. Preferably, the single or multicopy 
polypeptide only contains 6-*amino reactive side group 
chains. The second protecting agent acts on o-amine as 
well as €-amine side chain groups like those found in 
lysine to form a stable but reversible bond. The bond 

25 formed between the amine group and the protecting group 
is sufficiently stable to withstand the chemical 
modifying reaction conditions but also is easily 
reversible to allow for deprotection and regeneration of 
the original amine group. 

30 Suitable chemical protecting agents that form 

amine protecting groups can be selected by identifying 
protecting groups that form a less stable bond with the 
unprotected groups as compared with the stability of a 
bond, like an amine, formed at a-carboxyl of the 

35 C-terminal amino acid or the N-terminal a-amine. The 
chemical protecting agents form bonds at unprotected 
amine or hydroxyl groups that are less stable than and 



wo 94/01451 



PCT/US93/06591 



47 

are different from the biological protecting group at 
the N- and/or C-tennlnal that are typically a 
polypeptide^ peptide or an amino acid. Although not 
meant to limit the invention, the protecting group can 
5 be selected by identifying protecting group substituents 
that will stabilize the formation of a carbonium ion on 
the protecting group relative to the carbonium ion 
formed at the C-terminal a-carboxyl group. Substituents 
containing aromatic groups, oxygen, nitrogen, 
10 unsaturated groups, aromatic acetyl groups, carbamates, 
and cyclic anhydrides are groups that can act to 
stabilize the carbonitim ion on the "leaving protecting 
group'' and act to form a stable but reversible bond with 
amine. 

15 Suitable chemical protecting agents include 

alkyl, alkoxy or aryl carbamating agents, alkyl or aryl 
substituted acylating agents, and alkyl, alkoxy or aryl 
substituted anhydrides and aryl or unsaturated cyclic 
anhydrides. The order of preference of the protecting 

20 group is as follows: aryl or unsaturated cyclic 
anhydrides > carbamates > stabilized single acids. 

Specific examples of amine protecting groups 
include N-trichloroacetyl , N-trif luoroacetyl, 
N-o-nitrophenylacetyl , N-o-nitrophenoxyacetyl , 

25 N-acetoacetyl , N-3-phenylpropionyl, 
N-3- ( p-hydroxyphenyl ) propionyl , 
N-2-methyl-2- ( o-nitrophenoxy) propionyl , 
N-2-methyl-2- ( o-phenylazophenoxy) propionyl , 
N-4-chlorobutyryl , N-o-nitrocinnamoyl, N-picolinoyl, 

30 N- ( N ' -acetylmethiony 1 ) , N-benzoyl , N-phthaloyl , and 
N-dithiasuccinoyl . 

Suitable examples of carbamate protecting 
groups (including the amine) include methyl carbamate? 
N-f luorenylmethyl carbamate ; 2,2, 2-trichloroethyl 

35 carbamate; 2-trimethylsilylethyl carbamate; 

1,1-dimethylpropynyl carbamate; 1-methyl-l-phenylethyl 
carbamate; l-methyl-l-(4-biphenylyl) ethyl carbamate; 
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l*dimet:hyl*2-haloet:hyl carbamate; 

1, l-diinethyl-2-cyanoethyl carbamate; t-butyl carbamate; 
cyclobutyl carbamate; 1-methylcyclobutyl carbamate; 
1-adamantyl carbamate; vinyl carbamate; allyl carbamate; 
5 cinnamyl carbamate; 8-Quinolyl carbamate; 
N-hydroxypiperidinyl carbamate; 

4,5*-dlphez^l-3-oxazolln-2-one; benzyl carbamate; 
p-nitrobenzyl cairbamate; 3,4-dimethoxy-6-nltrobenzyl 
carbamate; 2,4-dichlorobenzyl carbamate; 

10 3-benzisoxazolylmethyl carbamate; 9-anthrylmethyl 
carbamate; diphenylmethyl carbamate; isonicotinyl 
carbamate; s-benzyl carbamate; and 
N- ( N ' -phenylaminothiocarbonyl ) derivative . 

Other amine protecting groups include N-allyl, 

15 N-phenacyl, N-3-acetoxypropyl, quatenary ammonium salts, 
N-methoxymethy 1 , N-benzyloxymethy 1 , N-pivaloyloxymethyl , 
N-tetrahydropyranyl, N-2,4-dinitrophenyl, N-benzyl, 
N-o-nitrobenzy 1 , N-di { p-methoxypheny 1 ) methyl , 
N-tr ipheny Ime t hyl , N- ( p-methoxypheny 1 ) diphenylmethyl , 

20 N-diphenyl-4-pyridylmethyl, N-2-picolyl-N' -oxide, 
N, N ' -isopropylidine , N-benzylidene , 
N-p-nitrobenzylidene, N-salicylidine, 

N- ( 5 , 5-dimethyl-3-oxo-l-cyclohexenyl ) , N-nitro , N-oxide , 

N-diphenylphosphinyl , N-dimethylthiophosphinyl , 
25 N-benzenesulfenyl, N-o-nitrobenzenesulfenyl, 

N-2 f 4 , 6-trimethylbenzenesul£enyl , N-toulenesulf onyl , 

N-benzylsulfonyl, N-trif luromethylsulfonyl, and 

N-phenacylsulf onyl . 

Especially preferred protecting agents of the 
30 invention are maleic or citraconic anhydrides. 

Typically, the amine groups can be protected by 

formation of an amide bond by the reaction of the amine 

groups with an anhydride as follows t 

35 basic pH 

RNH2 + ROCOCOR > RNHCOR + COOR 
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The reaction is conducted under conditions that favor 
the formation of a reversible, stable amide bond, 
preferably at the unprotected a-^amine group of the 
N-terminal amino acid and the e-amine group of lysine. 
5 Typically, arginine and histidine are much less 
reactive . 

Amine protection with carbamates proceeds by 
the reaction of the amine groups as follows: 

10 MaOH 

RNHj + (CH3)3COC02C02C(CH3)3 — > RNHCOOC ( CH3 ) 3 

25 C 



The reaction conditions are also chosen so that the 

15 unprotected N- terminal a-amine and lysine €-amine groups 
are protected. Typically, arginine and histidine are 
relatively unreactive. 

Polypeptide amine groups can also be protected 
by addition of other types of groups including 

20 N-alkylation or arylation. For example, reaction of 
amines with diazo compounds in the presence of boron 
trifloride results in N*alkylation of the amine groups. 

The selection of reaction conditions depends 
upon the polypeptide amino acid composition, the type of 

25 protecting groups added and the modifying agent chosen. 
Specific conditions and reagents for adding protecting 
groups to amine groups are described in Protective 
Groups in Organic Chemistry t T. Green, editor, John 
Wiley and Sons (1988), which is hereby incorporated by 

30 reference. 

b. Protection of the Amino Acids 
Having Hydroxyl Side Chains 

A preferred single or multicopy recombinant 

35 polypeptide or fusion protein useful in the method of 

the invention has one or two different types of reactive 

side chain groups, including amino acids having hydroxyl 

side chains. For example, a polypeptide can contain 

a-amine, e-amine and hydroxyl groups as reactive groups. 
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The roe-thod of the invention provides for protection of - 
amine and hydroxy 1 reactive side chain groups. 

The hydroxyl groups of the single or multicopy 
polypeptide are protected by reacting the polypeptide 
5 with the chemical protecting agent as described for 

amine protection. The chemical protecting agent forms a 
stable reversible bond at the side chain hydroxyl group 
in the same manner as described for amine protection. 
The bond formed between the hydroxyl group and the 

10 protecting group is sufficiently stable to withstand the 
chemical modifying reaction conditions but is also 
easily reversible to allow for deprotection and 
regeneration of the original hydroxyl group. 

SuitcLble second protecting agents are the same 

15 as described for amine protection including alkyl, 
alkoxy or aryl carbonating agents, alkyl or aryl 
siibstituted acylating agents, alkyl, alkoxy or aryl 
substituted anhydrides, aryl or unsaturated cyclic 
anhydrides. The preferred protecting groups (including 

20 the hydroxyl oxygen) that form a stable but easily 
reversible bond are, in order of preference, aryl or 
unsaturated cyclic anhydrides greater than ccurbamates^ 
greater than stabilized single acids. 

Specific examples of the protecting groups are 

25 provided in the amine protection section herein. The 
highly preferred amine and hydroxyl protecting agent is 
maleic anhydride. 

Alternatively, hydroxyl group protection can be 
achieved by reacting the starting material with a 

30 protecting agent that forms an ether or ester bond at 
the hydroxyl side chain groups. The ether or ester 
bonds formed are stable to the modifying conditions but 
are readily reversible to provide for regeneration of 
the original hydroxyl group. 

35 Specific examples of hydroxyl protecting groups 

include the following ethers: methyl ether; 
methoxymethyl ether (MOM); methylthiomethyl ether (MTM) ; 
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2-methoxyethoxyiDethyl ether (HEM) ; 

Bis (2-chloroethoxy) methyl ether; tetrahydropyranyl ether 
( THP ) ; tetrahydrothiopyranyl ether ; 
4-methoxytetrahyciropyranyl ether; 
5 4-methoxytetrahy±cothiopyranyl ether; tetrahydrofuranyl 
ether; tetrahydrothiofuranyl ether; 1-ethoxyethyl ether; 
l-methyl-l-methoxyethyl ether; 1- (phenylselenyl ) ethyl 
ether; t -butyl ether; allyl ether; benzyl ether; 
o-nitrobenzyl ether; triphenylmethyl ether; 
10 a-naphthyldiphenylmethyl ether; 

p-me t hoxypheny Idipheny Imethy 1 ether ; 9 - ( 9 --phenyl - 
10-oxo)anthryl ether (Tritylone); trimethylsilyl ether 
(TMS); isopropyldimethylsilyl ether; 

t-butyldimethylsilyl ether { TBDMS ) ; t-butyldiphenylsilyl 
15 ether; tribenzylsilyl ether; and trilsopropylsilyl 
ether. 

Specific examples of hydroxyl protecting groups 
include the following esters: formate ester; acetate 
ester; trichloroacetate ester; phenoxyacetate ester; 

20 isobutyrate ester; pivaloate ester; adamantoate ester; 
benzoate ester; 2,4,6-trimethylbenzoate (mesitoate) 
ester; methyl carbonate; 2,2,2-trichloroethyl carbonate; 
allyl carbonate; p-nitrophenyl carbonate; benzyl 
carbonate; p-nitrobenzyl carbonate; S-benzyl 

25 thiocarbonate; N-phenylcarbamate; nitrate ester; and 
2 , 4 -dinitrophenyl sulfonate ester . 

c. Protection of /3- or 7- Carboxyl Groups 

The single copy or multicopy polypeptide or 
30 fusion protein can also have amino acids with or 
r-carboxyl side chains. The i3- or r-carboxyl side 
chains can be protected with a chemical protecting agent 
that reacts with carboxyl groups to form a stable but 
reversible bond. The bond formed between the p-or 
35 r-carboxyl groups is sufficiently stable to withstand 
chemical modifying conditions at the a-carboxyl group 
but is also easily reversible to allow for deprotection 
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and regeneration of the original iS- or r-carboxyl group. 
The protection conditions for protecting carboxyl groups 
are also selected so that the amine and/or hydroxyl 
protecting groups are not adversely affected. 

Suitable protecting agents for protecting a 
carboxyl groups include o-nitrophenol esters, alkyl or 
benzyl esters, 1-hydroxybenzotriazol esters, 
alkylchlorocarbonates , azides and hydrazides. The 
especially preferred agent for the protection of 
carboxyl groups is o-nitrophenol. 

Specific examples of carboxyl protecting groups 
include the following esters, amides and hydrazides s 
methyl ester; methoxymethyl ester; methyl thiomethyl 
ester; tetrahydropyranyl ester; benzyloxymethyl ester; 
phenacyl ester; N-phthalimidomethyl ester; 
2,2,2-trichloroethyl ester; 2-haloethyl ester; 
2-(p*-toluenesulfonyl)ethyl ester; t-butyl ester; 
cinnamyl ester; benzyl ester; triphenylmethyl ester; 
Bis (o-nitrophenyl) methyl ester; 9-mthrylmethyl ester; 
2-(9,10-dioxo)anthrylmethyl ester; piperonyl ester; 
trimethylsilyl ester; t-butyldimethylsilyl ester; 
S-t-butyl ester; 2-alkyl-l,3-oxazolines; 
N,N-dimethylamide; N-7-nitroindoylamide; hydrazides; 
N-phenylhydrazide ; N ,N ' -diisopropylhydrazide . 

The preferred a-carboxyl protecting agent can 
act at the a- as well as the or r-carboxyl groups to 
form active esters. Selective modification like 
amidation of the a-carboxyl groups can be achieved by 
one of two methods . Protection of the )3- or a-carboxyl 
group can be a separate step, after the reaction of the 
single or multicopy polypeptide with the first 
protecting agent. Alternatively, protection of the /5- 
or a-carboxyl group can occur during the modification 
step. 

In the first method, the protection of /J- or 
o-carboxyl groups is accomplished in a separate step, 
typically after the amine and hydroxyl groups have been 
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protected with the first chemical protecting agent. The- 
single or multicopy peptide has an additional C- terminal 
amino acid such as arginine. The additional C-terminal 
amino acid residue acts to protect the a-*carboxyl group 
5 of the penultimate amino acid. The protected single or 
multicopy polypeptide with the C-terminal arginine 
residue is reacted with the second agent to add 
protecting groups to the or r-carboxyl groups as well 
as the a-carboxyl group of the arginine. The arginine 

10 group is removed by digestion with carboxypeptidase B 
leaving a single or multicopy peptide with protected )3- 
or r-carboxyl groups and an unprotected C-terminal 
a-carboxyl group . The unprotected C-terminal a-carboxyl 
group is then selectively amidated with the chemical 

15 amidating agent. 

In the second method, the or 7- or 
a-carboxyls are protected in the modification reaction. 
Selective a-carboxyl modification occurs by selecting 
conditions that favor the more reactive a-carboxyl group 

20 relative to the p-or 7-carboxyl groups. For example, 
when the carboxyl groups are protected by forming active 
esters, selective amidation occurs at the a-carboxyl 
group by the addition of stoichiometric amounts of 
ammonia at a pH of a 6 to 7 . While not in any way meant 

25 to limit the invention, the difference in the pKa values 
between the a-ester and or r-esters allows for the 
selective amidation at the a-carboxyl. 

d. Thiol Protection 

30 A single or multicopy recombinant polypeptide 

having at least one reactive side chain thiol group is 
reacted with a chemical protecting agent to form a 
thiol-specif ic protecting group. The bond formed 
between the thiol group and the protecting group is 

35 sufficiently stable to withstand the chemical modifying 
conditions, but is also easily reversible to allow for 
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deprotection and regeneration of the original thiol 
group. 

Specific examples of thiol protecting groups 
include S -benzyl thioether^ S-p-methoxybenzyl thioether, 
5 S-p-nitrobenzyl thioether, S-4-picolyl thioether, 
S-2-picolyl N-oxide thioether, S-9-anthrylinethyl 
thioether, S-diphenylmethyl thioether, 

S-Di ( p-methoxyphenyl ) methyl thioether , S-triphenylmethy 1 

thioether, S-2,4-Dinitrophenyl thioether, S-t-butyl 
10 thioether, S-isobutozymethyl hemithioacetal , 

S-2-tetrahydropyranyl hemithioacetal, S-acetamidomethyl 

aminothioacetal , S-cyanomethyl thioether, 

S-2 -nitro- 1-pheny lethyl thioether , 

S-2 , 2-Bis ( carboethoxy ) ethyl thioether , S-benzoyl 
15 derivative, S-(N-ethylcarbamate) , and S-ethyl disulfide. 

The preferred thiol protecting agent is acetic anhydride 

in potassium bicarbonate (CH3C02)0/KHC03. 

Typically, the thiol groups can be protected by 

formation of a thioether bond as follows: 

20 

(1) Cys-Sh + C6H5CH2CI > CysSCHjCfiHs 

or 

25 CF3CO2H 

(2) CysSH + (C6H5)2 CHOH > CysSCH(C6H5)2 

25«C, 15 min 

30 The reaction is conducted under conditions that favor 
the formation of a reversible stable thioether bond. 
Typically, methionine is not reactive under these 
conditions . 

Alternatively, thiol groups can be protected by 
35 formation of a thioester bond as follows: 

(CH3CO)20/KHC03 
CysSH ' — ^> CysSCOCHa 

40 

The single copy or multicopy polypeptide can be 
transferred into an organic solvent such as 
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dlmethylformanide, if necessary. Other reactive side 
chain group are not adversely affected by these reaction 
conditions • 

The selection of reaction conditions depends 
5 upon the single copy polypeptide amino acid composition, 
the type of protecting groups added, and the modifying 
agent chosen. Specific conditions and reagents for 
adding protecting groups to thiol groups are described 
in Protective Groups in Organic Chemistry > T. Greene, 
10 editor, John Wiley and Sons (1988), which is hereby 
incorporated by reference. 

2. Cleavage of the Biological Protecting Group 

The biological protecting group is cleaved to generate 

15 an unprotected N- or C-terminal a-carbon reactive group. 
The cleavage step can take place either before or after 
the reaction of the starting material with the chemical 
protecting agents. In the preferred embodiment, 
cleavage occurs after protection of the side chain 

20 reactive groups with the protecting agents. The 

cleavage step can require more than one cleavage reagent 
to generate the unprotected N- or C-*terminal a-carbon 
reactive group. The unprotected C- or H-terminal 
carbon reactive groups are available for modification. 

25 The cleaving reagent is an enzyme or chemical 

reagent that cleaves at the recognition seG[uence of the 
inter- or intraconnecting peptide or removes 
intraconnecting amino acids from the or C-terminal 
end. Specific example of the enzymes and chemical 

30 cleavage reagents specific for inter* or intraconnecting 
peptides are provided in Table 1. Enzymes that remove 
amino acid residues from the C-terminal end are 
carboxypeptidases and include carboxypeptidase A, 
carboxypeptidase B, carboxypeptidase T, and 

35 carboxypeptidase K. Enzymes that remove residues from 
the N-terminal end are aminopeptidases , and include 
leucine aminopeptidase , amino peptidase M, Aeromonas 
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aminopeptldase , X*prolyl dipeptidyl amino peptidase, as 
well as enzymes listed in Table 1. 

A single cleavage reagent can be sufficient but 
multiple cleavage reagents may be necessary to provide 
5 an unprotected N- or C-terminal a-carbon reaction group. 
The inter- or intraconnecting peptide can contain 
multiple cleavage sites and preferably has at least one 
enzymatic cleavage site £tnd one chemical cleavage site. 
In site- specific cleavage, amino acid residues of the 

10 inter- or intraconnecting peptide can remain at the N- 
or C-terminal ends and require removal by c£u:boxy- or 
aminopeptidase enzymatic digestion. 

Multiple cleavage reagents and steps can also 
be required depending on the selection of the starting 

15 material. For example, if the starting material is a 
multicopy fusion protein, cleavage with a cleavage 
reagent specific for the inter- and intraconnecting 
peptide generates a mixture of single copy polypeptides. 
Preferably the interconnecting and intraconnecting 

20 peptide have a sequence that is recognized by the same 
cleavage reagent so single copy polypeptides can be 
generated in a single step using a single cleavage 
reagent. If the interconnecting and intraconnecting 
peptides are different, two different cleavage enzymes 

25 can be employed together or sequentially to generate the 
single copy polypeptides. The mixture of single copy 
polypeptides contain single copy polypeptides having 
intraconnecting peptide at the C-terminal end. If 
modification is to be made at the C-terminal a-carboxyl 

30 group, the mixture is also cleaved with a 

carboxypeptidase to remove the intraconnecting peptide 
at the C-terminal end. 

Multiple cleavage steps can be required if both 
the N- and C-terminal a-carbon reactive groups are to be 

35 modified. For example, a recombinant single copy 

polypeptide protected at both the N- and C-terminal ends 
with biological protecting groups is sequentially 
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cleaved. Typically, the N-terminal biological 
protecting group is removed and the N-tenninal a-amine 
group is then modified. The C-terminal protecting group 
is then removed and the C-terminal a-carboxyl group is 
5 then modified. In this case, the N- and C-terminal 
biological protecting groups contain different 
recognition sequences for cleavage reagents to allow for 
sequential cleavage. 

In a preferred version, the recombinant 

10 multicopy fusion protein having three copies of the 
mastoparan polypeptide intraconnected by arginine 
residues and interconnected by enterokinase recognition 
peptide sequence to carbonic anhydrase or interconnected 
by a methionine glycine asparagine peptide sequence to a 

15 mini-modified carbonic anhydrase and with a C-terminal 
arginine is cleaved to form single copy polypeptides by 
sequential cleavage. The multicopy fusion protein is 
cleaved with enterokinase, or in the case of the mini- 
modified carbonic anhydrase is cleaved with CNBr to 

20 remove the carbonic anhydrase sequence, to produce 
respectively a multicopy polypeptide or a demifusion 
multicopy polypeptide. The multicopy polypeptide is 
then reacted with maleic anhydride which adds a 
protecting group to unprotected e-amino groups of lysine 

25 present in the mastoparan polypeptide. The demifusion 
protein is then cleaved with hydroxy 1 amine to remove 
the N-terminal biological protecting group 
(interconnecting peptide residue). The multicopy 
polypeptide or hydroxylamine treated demifusion 

30 multicopy polypeptide is then cleaved with trypsin to 
produce a mixture of single copy polypeptides. The 
protected lysine groups are not recognized and cleaved 
with trypsin. The mixture of single copy polypeptides 
contains single copy polypeptides wit 

35 h unprotected N-terminal a-amine groups and 

intraconnecting peptide at the C-terminal a-carboxyl 
group. If the C-terminal a-carboxyl group is to be 
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modified, the unprotected N-tezminal a-amine is 
protected by reaction with a chemical protecting agent, 
like maleic anhydride and the C*terminal Intraconnecting 
peptide residues are removed by cleavage with a 
5 carboxypeptldase. The side chain protected single copy 
polypeptide with unprotected C-terminal a-carboxyl 
produced can then be modified. 

C. Selective Modification of M-Terminal a-Amine 
10 and/or C-Terminal a^Carboxvl Groups 

Recombinant polypeptides or peptides can be 

modified selectively at the N-termlnal or C-terminal 

a-carbon reactive groups by the addition of a variety of 

organic moieties. While liot in any way meant to limit 

15 the invention, modification reactions at the C-terminal 
a-carboxyl or N-terminal a-amine groups are those that 
proceed by nucleophilic substitution. Nucleophilic 
substitutions are described in Advanced Organic 
Chemistry, in Chapter 10, 3rd ed., John Wiley and Sons, 

20 editor, NY (J. March 1984), which is hereby incorporated 
by reference. The bonds formed at the N- and/or 
C-terminal a-carbon reactive groups are stable and 
generally irreversible under the deprotection conditions 
employed to regenerate the side chain groups. 

25 Polypeptides can be sequentially modified at the N- and 
C-terminal a-carbon reactive group by the same or 
different modifications. 

Specific examples include addition to or 
replacement . of terminal amino acids with a D-amino acid, 

30 D-amino acid containing peptide, L-amino acid peptide, 
or an amino acid analogue or derivative at one or both 
of the terminal ends of the recombinant polypeptide by 
formation of an amide bond. Another modification is the 
conversion of an N-terminal glutamic acid or glutamine 

35 to a pyroglutamyl residue. The preferred modification 
of the method of the invention is the selective 
C-terminal a-carboxyl amidation reaction or the selected 
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N-terminal a-amine reaction with a synthetic organic 
group. 

The modification made to the N-terminal and/or 
C-terminal a-carbon reactive group can be selected 
5 according to several factors. Factors to be considered 
in selecting the terminal modifications are the amino 
acid sequence of the single copy polypeptide, the size 
of the single copy polypeptide, the change in the 
biological activity of the single copy polypeptide, how 

10 the modified single copy polypeptide is going to be 
used, and prevention of racemization at the modified 
and/or C-terminal a-carbon. 

The amino acid sequence of the single copy 
polypeptide preferably has about one or two different 

15 reactive side chain groups* For example, a polypeptide 
having an €->amine and hydroxyl side chain groups can be 
protected in a single step using an amine protecting 
agent as described previously. The modifications, 
conditions and agent are chosen so that the e-*amine and 

20 hydroxyl groups are not deprotected or otherwise 

adversely affected during the modification reaction. In 
contrast, a single copy polypeptide with both 6-amine, 
hydroxyl, /5- or r-carboxyl, and thiol groups can require 
reaction with three different protecting agents to 

25 provide for side chain protection of the e-amine and 
hydroxyl groups, /3- or r-carboxyl groups, and thiol 
groups. The modification conditions and reactions are 
selected so that the side chain protecting groups remain 
intact and are not adversely affected. 

30 Conditions that lead to deprotection of the 

amine, carboxyl and thiol protecting groups are 
described in Protecting Groups in Organic Synthesis . T. 
Green, editor, John Wiley and Sons (1988). These 
conditions should be avoided during the modification 

35 process and, thus, the modification reaction conditions 
should be chosen to avoid or prevent deprotection of 
these side chain reactive groups. 
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The size of the single copy polypeptide is 
preferably about 10-50 amino acids. While the selective 
modification methods of the invention can be conducted 
on larger polypeptides, reaction conditions for adding 
5 protecting groups and modifying groups are selected so 
as not to cause irreversible denatiiration of the 
polypeptide. Polypeptides with greater than 50 amino 
acids are protected and modified in aqueous solutions of 
a pH of about 2-10 and a temperature of less than about 
10 50«C. 

Modifications to the polypeptide can change the 
biological activity of the polypeptide. For exaunple, 
C-terminal amidation of many small peptides, like 
mastoparan or the human gastrin releasing peptide, 

15 enhances the biological activity of these peptides. 

Moreover, addition of peptide seejuences of D or L-amino 
acids can provide for targeting of the polypeptide to a 
specific cell type, decreasing the rate of breakdown and 
clearance of the peptide, increasing the biological 

20 potency and adding other biological activities to the 
polypeptide. Addition of D-amino acids or peptides or 
derivatives of amino acids can also result in the 
formation of antagonists. The choice of modification 
can be made upon the desired change of the biological 

25 activity of the peptide. 

The fourth factor to consider in selecting 
modifying reactions and conditions is how the modified 
product is going to be used. If the polypeptide is to 
be used in vivo , the modification selected can be one 

30 that enhances, targets, expands, or inhibits the 
biological activity of the polypeptide. If the 
polypeptide is being modified for use in a diagnostic 
test, the impact of the modification on the structure of 
the polypeptide rather than the biological activity is 

35 examined. For use in diagnostic tests, the modified 
polypeptide is still specifically recognized by 
antibodies or by specific binding to a teurget molecule. 
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The fifth factor to consider In choosing the 
modification reaction and conditions is to prevent 
formation of a racemic mixture of the modified single 
copy polypeptides. Some types of modification reactions 
5 are known to result in racemic mixtures and, thus, are 
not suitable for the method of the present Invention. 

Specific examples of modification reactions and 
conditions follow. 

10 1. Selective Afflidation of the 

Carboxy^Terminal Amino Acid 

The protected single copy polypeptide having 
unprotected C-termlnal a-carboxyl group is reacted with 
a chemical amldatlng agents by standard methods, as 

15 described in Bodanszky, Peptide Chemistry a A Practical 
Textbook, Sprlnger-Varlag, publisher (1988), which is 
hereby Incorporated by reference. Suitable chemical 
amldatlng agents include l-ethyl-3-{3-dimethyl- 
aminopropyl) ethyl carbodiimide hydrochloride and 

20 ammonia, water soluble carbodiimides and ammonia, 

dicyclohexyl carbodiimide and ammonia, acid chlorides 
and ammonia, azldes and ammonia, mixed anhydrides and 
ammonia, methanollc HCl and ammonia, o-nltrophenyl 
esters and ammonia and esters of 1-hydroxybenzotrazole 

25 and ammonia. 

Typically, the protected polypeptide is reacted 
with a chemical amldatlng agent like carbodiimide and 
o-nltrophenol to form activated esters as follows: 

30 (1) RCOOH + CfiHsCNCCfiHs + CfiH^OHNOj > RCOOCfiH^NOj 

(2) RCOOCfiH^NOi + NH3 > RCONH2 + CfiH^OHNOj 

The amidation occurs upon addition of ammonia or a 
source of ammonia to the active ester. Other carboxyl 
35 or acidic side chains present in the polypeptide, if not 
already also protected, form active esters. In order to 
provide for a selective a-carboxyl C-terminal amidation, 
reaction conditions are chosen to favor amidation at the 
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more reactive a-carboxyl in contrast to the or 7- 
ceorboxyl side chains. For example, addition of a 
stoichiometric amount of ammonia at a pH of about 6 
favors the formation of the amide at the a-carboxyl 
5 group. Carboxyl activating and amidation conditions are 
also such that deprotection of the amine hydroxyl groups 
does not occur. 

An alternative method of amidation is to react 
the unprotected C*terminal a-carboxyl group with the 
10 photonucleophile o-nltrophenol-glycineamide. The 

photonucleophile acts to convert the carboxyl group to 
the amide. 

The selection of reaction conditions depends 
upon the amino acid composition of the polypeptide, the 

15 type of protecting group utilized, and the chemical 
amidating agent chosen. For example, if the 
polypeptide does not contain A- or r-carboxyl groups, 
the utilization of conditions favoring a-carboxyl 
amidation is not necessary. 

20 The preferred side chain protected mastoparan 

polypeptide is reacted with l-ethyl-3-(3-dimethyl- 
aminopropyl) ethyl ccurbodiimide hydrochloride in the 
presence of excess NH^OH to form a C-terminal amidated 
protected mastoparan polypeptide. Since mastoparan does 

25 not contain aspartic or glutamic acid, reaction 

conditions are not adjusted to favor amidation of the 
a-carboxyl group. The C*- terminal amidated protected 
polypeptide is then deprotected and purified. 

30 2. Modification of M-terminal and C-terminal Amino 

Acid With D-amino Acids or Peptides, L-Amino 
Acid Peptides, and Amino Acid Derivatives 

A D-amino acid, L->amino acid, an amino acid 

derivative, or peptides containing a combination thereof 

35 can be added to the N-terminal and/or C-terminal 

a-carbon reactive group of the protected single copy 

polypeptide by transamidation or by segment condensation 

reaction. Alternatively, the D-amino acid, L«-amino 
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acid, amino acid derivative or peptides containing a 
mixture thereof can replace the N-terminal or C-terminal 
amino acid or amino acids of a portion of a side chain 
protected recombinant single copy polypeptide. 
5 Typically, a D-amino acid, L-amino acid, amino 

acid derivative, or peptide can be added by well known 
solution or solid phase peptide synthesis, as described 
in Solid Ph ase Peptide Synthesis . 2nd Edition, J.M. 
Steward and J.D. Young, editors. Pierce Chemical Co., 

10 Rockford, IL (1984), which is incorporated herein by 
reference. One example of such a reaction is adding a 
urethane blocked amino acid to the free N-terminal 
a-amine of the side chain protected single copy 
polypeptide in the presence of carbodiimide, mixed 

15 anhydrides or active esters. The reaction scheme is 
represented as follows: 

Carbodiimide 

20 (CH3)3COONH-CHR-COOH+NH2R' > (CHajaCOONHCHRCONHR' ) 

organic solvent 

25 An alternative synthesis is the segment 

condensation procedure, which is preferably used when 
small peptides are coupled to the N-terminal a-amine 
groups as described by F. Finn et al., in The Proteins . 
3rd ed., Neurath and Hill, editors. Academic Press, NY, 

30 vol. 2, pp. 105-253 (1976), which is hereby incorporated 
by reference. 

Replacement of the N-terminal amino acid(s) can 
be accomplished by removing the N-terminal amino acid or 
amino acids by cleavage with a chemical or enzymatic 

35 cleavage reagent like those provided in Table 1 or with 
an amino or carboxypeptidase . Alternatively, the 
recombinantly produced single copy polypeptide can be 
produced so that gene sequence lacks the codons for the 
N-terminal or C-terminal amino acid or amino acids. The 

40 protected single copy polypeptide preferably lacking up 
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to about 10 N-termlnal amino acids can be modified by 
the addition of a D-amino acid, L-amino acid, amino acid 
derivative, or peptide containing a mixture thereof as 
described above. 
5 A specific example includes replacement of the 

two N-terminal amino acids of ovine jS-endorphin with a 
dipeptide Tyr-D-Ala. The naturally occurring ovine 
^-endorphin has 31 amino acids. The starting material 
for the recombinantly produced peptide is a multicopy 
10 polypeptide fusion protein containing multicopies of a 
truncated jS-endorphin (amino acids 3-31) intraconnected 
by arginine. 

(1) protect (2) cleave 

15 BP-Arg-B3.3i-Arg-B3.3i-Arg-B3.3i-Arg > > 

maleic with 
anhydride trypsin 



20 (3) segment condensation 

NH2-B3.3j-Arg + FMOC-Tyr-D-Ala-COOH > 

carbodiimide 

( 4 ) deprotect 
25 PMOC-Tyr-D-Ala-B3.3i-Arg -> 

(a) pH=2 about 2 hours 

(b) carboxypeptidase 

Tyr-D-Ala-B3.3i 

30 



Key 

35 BP-Arg-B3.3i-Arg-B3.3i-Arg-B3.3i-Arg = 

multicopy fusion protein 
composed of binding protein 
(BP) interconnected by Arg 
to multiple copies of 

40 truncated )3-endorphin (B3.31) 

intraconnected by arginine 

NH2-B3.3i-Arg = single copy truncated ovine 

p-endoarphin with C-terminal 
45 arginine and unprotected 

N-terminal a-amine 

FMOC-Tyr-D-Ala . = dipeptide protected at 

N-terminal with FMOC 
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( 9-f luorenylmethyloxycarbon 

yi) 

FMOC-Tyr-D-Ala-B3.3i-Arg = N-terminal modified 
5 protected ^-endorphin 

Tyr-D-Ala-B3.3i = N-terminally modified 

P-endorphin 

10 Specific examples of the types of modifications 

made to biologically active peptides include addition of 
L-N- ( 2-oxopiperidine-6-ylcarbonyl ) -L-histidyl-L- 
thiazolidine-4-carboxamide to thyroliberin (TRP), 
3-methylhistidine to TRF, modified C-terminal 

15 des-Gly^°-Pro^-N-ethylamide to leutinizing releasing 

factor (LRF), modified N-terminal of LRF with Ac-D-Phe^ 
and/or PCl-D-Phe^, N-terminal pyroglutamyl residue to 
litorin^ D-alanine at the 2-position of enkephalin, 
C-terminal modification adding methioninol sulfoxide at 

20 the C-terminal of enkephlin, and a and 7 endorphin 

amides. Other analogs of biologically active peptides 
are described in Kirk-Othmer Chemical Encyclopedia . 
12:603-617, which is hereby incorporated by reference. 
The preferred modification is the additional of a 

25 D-amino acid at the C-terminal or N-terminal end of the 
protected single copy polypeptide. 

Specific examples of derivatives of amino acids 
that can be added to or replace terminal amino acids 
include pyroglutamyl residues, homoserine, 

30 hydroxyproline, 3-methylhistidine, hydroxylysine, 
desmosine, N-methylglycine, N-methylisoleucine, and 
N-methyl valine . 



3. Formation of N-Terminal Acetvl Groups 

35 Naturally occurring polypeptides and analogues 

can have N-terminal acetyl groups or N-terminal 
oligopeptide prefix sequence or N-terminal synthetic 
organic moieties. The modification reaction providing 
for N-terminal acetyl groups or N-terminal oligopeptide 

40 prefix sequence or N-terminal synthetic organic moieties 
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involves reaction of a protected single copy polypeptide 
with an unprotected N-terminal a-amine group with acetic 
anhydride or oligopeptide prefix or synthetic organic 
moiety as follows t 



a) NH2CR1COOR2 + (CH3CO)20 



CH3CONHCR1COOR2 + CH3COOH 



b) NH2CR1COOR2 + (AA)^CR3C00H 



10 (AA)^CHR3CONHCRiCOOR2 + HjO 
c) NH2CR1COOR2 + R3COOH 



R3CONHCRiCOOR2 + H^O 
An example of an analogue that has an 
15 acetylated N-terminal amino acid is an LRF antagonist. 



D. Deprotectlon 

The side chain protected modified polypeptide 
is then deprotected using a variety of conditions 

20 depending upon the particular protecting group involved. 
Deprotection involves removal of the protecting group 
and regenerating the original reactive group without 
undesirable side reactions. Deprotection conditions do 
not adversely affect the N- and/or C-terminal 

25 modification. 

The deprotection conditions chosen will depend 
on the type of protecting group. For example, amide and 
carbamate protecting groups can be removed by incubation 
under acidic condition of a pH remging from about 1-4. 

30 Other conditions allowing for the removal of the amine 
and hydroxyl protecting groups without undesirable side 
reactions are described in Protective Groups in Organic 
Chemistry , cited supra . 

Specific examples of the cleavage of the amine 

35 and hydroxyl protecting groups include the following 
reactions : 
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Cleavage of earbanates: 

CFsCOOH/PhSH 

1) R1JHC00(CH3)3 COOC(CH3)3 > RNH, 

5 25 C 1 hr. 



piperidine 25 C 

2) RNH-N-9-fluorenyl > rmh, 

10 10 min. 



pH 2-3 

3) RNHCOCH = CHCOOH — : > rmh, 

15 2 hrs. 

Carboxyl protecting groups can be removed by 
incubation at a high pH of^ about 8-11. Other conditions 
for removal of carboxyl protecting groups without 
20 undesirable side reactions are described in Protective 
Groups in Organic Chemistry , cited supra . Specific 
exastples of the cleavage of carbo^^l protecting groups 
include the following reactions! 

25 CF3COOH 

RCOOOCHjSCHj — — > RCHOOH, 80-90% 

25»C, 15 min. 

30 KOH/l8-crown-6, PhMe 

ArCOO-t-Bu ^> ArCOOH 

lOO'C, 5 hr., 94% 

35 Thiol protecting groups can be removed in the 

presence of Na and IJH3. Other conditions for removal of 
thiol protecting groups are described in Protective 
Groups in Organic Chemistry , cited supra . 

Specific examples of the cleavage of thiol 

40 protecting groups include the following reactions t 

Na/NH, 

1) CysSCHjCtHs > cysSH 

10 min. 

45 
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0.2NaOHxN2 

2) CysSCOCHa > CysSH 

20»C 

5 

In addition, the modified side chain protected 
polypeptide can also have the intraconnecting peptide 
residues at the C-* or N« terminal end. If the 
intraconnecting residues were not removed at an earlier 

10 point in the reaction scheme, they can be digested and 
removed with a cleavage enzyme, like a carboxy or 
aminopeptidase • 

If the side chain protected single copy 
polypeptide has more than one type of protecting group 

15 present, like for example ^an amine protecting group and 
carboxy 1 protecting group, deprotection can be conducted 
so that the protecting groups are removed sequentially. 
For example, the amine and hydroxyl protecting groups 
can be removed by incubation at a pH of about 2 for 2 

20 hours. Then the carboxyl protecting groups can removed 
by incubating at a pH of about 8-11 for 2 hours. Other 
combinations of deprotection conditions can be utilized 
to remove protecting groups from the reactive side 
chains to regenerate the original reactive group. 

25 After deprotection, the final product is a 

single copy polypeptide with a modified C- and/or 
N- terminal amino acid. The final product can be 
purified by standard methods including size exclusion, 
ion exchange, or affinity chromatography. In a 

30 preferred version, a small peptide like mastoparan can 
be purified by size exclusion column or HPLC 
chromatography • 

The invention has been described with reference 
to various specific and preferred embodiments and 

35 techniques. However, it should be understood that many 
variations and modifications can be made while remaining 
within the spirit and scope of the invention. 
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EXAMPLE 1 

Formation of a C-Termlnal a-Amlde Polypeptide 
from a Recombinant Multicopy Fusion Protein 
5 Having C^terminal Aroinine Groups 

An expression vector that has a recombinant 

gene encoding a multicopy fusion protein is formed by 

standard recombinant DNA methodologies. Briefly, the 

gene for human carbonic anhydrase is modified by removal 

10 of the nucleotide sequence for the three C*terminal 

amino acids. Alternatively, the gene for mini-modified 
human c2irbonic anhydrase is modified by conversion of 
methionine 240 to leucine or serine and removal of the 
nucleotide sequence for the three C-terminal amino 

15 acids . The gene encoding "a multicopy polypeptide 
containing three copies of a mastoparan polypeptide 
intracoimected by arginine residues and having a C- 
terminal arginine is synthesized by automated 
techniques. The automated techniques are described 

20 generally by S. Beaucage et al., Tetra. Letters > 221:859 
(1981) which is hereby incorporated by reference. The 
synthesis of the multicopy mastoparan polypeptide with 
C-terminal arginine (45 amino acids) is conducted using 
optimal codon usage for coli and results in a 

25 multicopy polypeptide having useful restriction 
endonuclease sites. The DNA sequence for the 
interconnecting peptide containing enterokinase 
recognition sequence (Val-Asp-Asp-Asp-Lys) (SEQ ID NO: 8) 
is synthesized by the automated methods as described 

30 above. Alternatively, the interconnecting peptide DNA 
sequence can be a methionine glycine asparagine glycine 
seq[uence for use with the mini -modified human carbonic 
anhydrase. This gene can be synthesized by the 
automated methods as described above. 

35 The gene for humam carbonic anhydrase is 

inserted in a plasmid downstream from a T7 promoter by 
standard methods generally known in the art and 
described by Sambrook et al., cited supra . The DNA 
sequence for the interconnecting peptide is inserted 
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downstream from the carbonic anhydrase gene • The gene - 
encoding a multiple copy of the mastoparan polypeptide 
is inserted immediately downstream from the sequence for 
the interconnecting peptide. 
5 Typically DNA sequences are inserted by 

restriction endonuclease digestion and ligation as 
described herein. A 0.5 to 2 mg sample of plasmid DNA 
is digested in 20 ml of a IX restriction buffer with 1 
to 20 units of restriction enzyme. The reaction mix is 

10 incubated for 1 to 16 hours at the temperature 

recommended by the enzyme supplier. The linearized 
vector can then be dephosphorylated with calf intestinal 
phosphatase or bacterial alkaline phosphatase under 
conditions known to those with skill in the art, e.g. 

15 suggested by the supplier. The DNA is then further 
purified by standard procedures (See Sambrook et al., 
cited supra > which usually involve a phenyl extraction 
and ethanol precipitation. 

The DNA segment to be inserted is then mixed in 

20 a 3 to 5 fold (for large fragments) or 20 to 30 fold 

(for short oligonucleotides) molar excess precut cloning 
vector. The ligation is performed in a IX ligation 
buffer (20 mm tris pH 7.6, 10mm magnesium chloride, 
0.4mm /3-mercaptoethanol, 0.4 to Im ATP), in the presence 

25 of T4 DNA ligase at 16 ""c for 16 hours. The same 

procedure is repeated successively to add DNA segments 
successively and the restriction endonucleases are 
chosen to selectively place the newly inserted DNA 
segments. An aliquot of a reformed vector is then used 

30 to transform competent E. coli cells by calcium chloride 
precipitation and selected for recombinant plasmids. 

Bacteria are transformed with the plasmid DNA. 
Liiria Broth is inoculated with a bacterial culture and 
the cells are grown with agitation at optimum 

35 temperature to a density of about 10^ to 10^ cells per 
ml. The culture is chilled to about 0*C, centrifuged 
and the cells are collected. The cells are then 
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resuspended in an ice cold sterile solution of SOimn or - 
calcium chloride and lOmm tris chloride (pH 8.0). The 
centrifuge and resuspension step is repeated one more 
time. The results of the concentrated suspension of 
5 treated cells are ready to accept the new vector. 

Typically the new vector contains a selective marker or 
reporter gene. Selective marker genes generally encode 
antibiotic resistance. 

For meiximum transformation efficiency the 

10 bacterial culture preferably is in logarithmic phase of 
growth; the cell density preferably is low at the time 
of treatment with calcium chloride; and the treated 
cells are preferably maintained at 40 *C for 12 to 24 
hours. To take up the vector an aliquot of the ligation 

15 reaction is added to the suspension of treated cells. 
The combination is mixed and stored on ice for a short 
time. Up to 40 nanograms of DNA (dissolved in up to 100 
microliters of ligation buffer or TE) can be used for 
each transformation. Next, the transformed cells and 

20 culture tubes are transferred to a 40 *C water bath for 2 
minutes. An aliquot of luria broth is added to each 
tube and the cells inciibated at 37 *C for about 30 
minutes (tetracycline selection) or 1 hour (ampicillin 
or kanamycin selection) . This period of time allows the 

25 bacteria to recover and to begin to express antibiotic 
resistance. The cells are spread onto selective media 
and incubated at optimum temperature. Colonies will 
appear overnight (adapted from Sambrook et al., cited 
supra . 

30 Transformed coli are selected through the 

use of plates containing the appropriate antibiotic 
(i.e., the one to which resistance is conferred by the 
introduced plasmid) . Typical final concentrations are 
ampicillin at a 100 micrograms per ml, chlorophenicol at 

35 10 micrograms per ml, kanamycin at 50 micrograms per ml, 
streptomycin at 25 micrograms per ml, tetracycline at 15 
micrograms per ml. When using coli bl21 (DE3) plys 
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as the host, transf ormants are plated out on a medium 
containing both ampicillin and chlorophenicol at the 
above concentrations. 

In a preferred embodiment the method for 
5 culturing transformed cells can be practiced as 

described in Sambrook et al., cited supra , Briefly, the 
method entails transferring of single transformed and 
selected bacterial colony to a small volume (3 to 5 ml) 
of bacterial growth medium (such as luria broth) 

10 containing an appropriate antibiotic. The culture is 

incubated at 37 *C (or other appropriate temperature) and 
scaled up to large volumes. 

Cells are lysed with sonication in 830 ml of 
50mm Tris«HCl(pH 7.9) - 0.5mm EDTA containing 100mm 

15 sodium chloride with 10 micrograms per ml of DNASE I. 
Lysozyme (30 milligrams) is added and the lysate is 
incubated overnight to disrupt the cell fragments. 

To purify recombinant protein from insoluble 
granules, the lysate is then centrifuged, incubated with 

20 sodium deoxycholate, and washed several times. The cell 
lysate is then frozen and thawed. The cell lysate is 
further purified by ultrafiltration and DEAE column 
chromatography. The partially purified fusion protein 
is then further purified on an affinity column 

25 containing sulfanilamide. The partially purified cell 
lysate is pumped through a column of sulfanilamide- 
sepharose prepared by conventional methods. The bound 
protein is washed with 0.5M Tris-sulfate-lM-sodium 
sulfate (pH 7.5) to remove other materials. The bound 

30 multicopy fusion protein containing cubonic anhydrase 
is eluted with 0.2M potassium thiocyanate and 0.5M-Tris- 
sulfate (pH 7.5). 

The purified multicopy fusion protein is 
digested with bovine enterokinase in 10mm tris buffer 

35 (pH = 8.0) at 37 'c for 15 hours. The enterokinase 

cleaves at the Asp^Lys interconnecting peptide to form 
free carbonic anhydrase enzyme and a multicopy fusion 



wo 94/01451 



PCr/US93/06S91 



73 

protein with a free a-axnine group and a C-terminal 
arginine group. The multicopy peptide is purified from 
the carbonic anhydrase by ultrafiltration. 
Alternatively, the purified multicopy demifusion protein 
5 precursor is treated with cyanogen bromide in tris 
buffer (pH=8.0) to cleave the carbonic anhydrase 
sequence. The multicopy demifusion peptide is purified 
from the carbonic anhydrase residue by ultrafiltration. 

The o-amine, €-amine groups and hydroxyl groups 
10 present in the multicopy polypeptide are protected by 
reaction of the polypeptide with an amine protecting 
group like maleic anhydride, if the multicopy 
demifusion protein is used, the o-amine is already 
protected by a biological protecting group and the €- 
15 hydroxyl groups present in the multicopy demifusion 
polypeptide are protected as described above. The 
maleic anhydride reacts with amines and forms acidic 
amide protecting groups in the presence of SM GuHCl (pH 
8 to 8.5) • This reaction is followed by a buffer 
20 exchange by IK ultrafiltration. 

If the multicopy polypeptide contains carboxyl 
groups, the p- or r-carboxyl groups are protected using 
an activated alcohol like methanol or ethanol. The 
multicopy polypeptide or the multicopy demifusion 
25 polypeptide is then cleaved with trypsin. The trypsin 
cleaves only at the intracoimecting arginine residues 
and not at the amine protected lysine residues. The 
trypsin digestion results in the formation of single 
copy polypeptides, some of which have free N- terminal 
30 amine groups or if the multicopy demifusion polypeptide 
is used, all of which have free N-terminal amine groups. 

The single copy polypeptides are then digested 
with carboxypeptidase B. The carboxypeptidase B cleaves 
arginine residues from the C-terminal. If the C- 
35 terminal arginine residues are protected at the a- 

carboxyl group the carboxypeptidase cleaves the ester- 
protecting group as well as removing the arginine. 
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The mixture of single copy polypeptides, some " 
having free a-amlne groups Is treated with malelc 
anhydride again to protect the free amine groups 
generated upon cleavage with trypsin. The fully 
5 protected single copy polypeptides are then exchanged 
Into a mixture of dimethyl formamlde and methylene 
chloride . 

The protected polypeptide has protected 
N-termlnal a-amlne and an unprotected C-termlnal a- 

10 Ccucboxyl group generated upon cleavage of the C->termlnal 
arglnlne. The protected polypeptide is reacted with 
dlcyclohexylcarbodilmide and o-nitrophenol to produce an 
active ester at the C-termlnal o-carboxyl group. The 
activated protected polypeptide is then transferred to 

15 an aqueous solution ammonia to form the amine protected 
C-terminal a-amlde polypeptide. 

The protected a-amldated polypeptide amine and 
hydroxyl groups are deprotected by treatment at a pH of 
about 2.0 for 2 hours at 20 The carboxyl groups are 

20 deprotected by alkaline treatment at a pH of about 8 to 
10. The deprotected C-termlnal a-amide polypeptide is 
purified by size exclusion chromatography. 

EXAMPLE 2 

25 Formation of C«>termlnal a-Amide Polypeptide 

from a Recombinant Multicopy Protein 

Recombinant multicopy protein is formed as 

described in Example 1. The recombinant multicopy 

protein has multiple copies of the single copy 

30 polypeptide connected with an Intraconnecting peptide. 
The recombinant multicopy polypeptide contains three 
copies of the myosin light chain kinase Inhibitor 
intraconnected with glutamic acid. The sequence of the 
myosin light chain kinase inhibitor is Lys-Arg-Arg-Trp- 

35 Lys-Lys-Asn-Phe-Ala*Val (SEQ ID N0:9). The DNA sequence 
encoding the multicopy protein is synthesized by 
automated methods, and cloned downstream from the T7 
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promoter in an expression vector prepared as described 
in Example !• 

The recombinant multicopy protein is eaqiressed 
in transformed coli having a recombinant expression 
5 vector prepared as described in Example 1. The 
recombinant multicopy protein is purified from 
transformed cell lysates by affinity chromatography 
utilizing an immobilized monoclonal antibody specific 
for nyosin light chain kinase inhibitor. 

10 The multicopy polypeptide is then cleaved with 

Staphylococcus aureus V8 cleavage enzyme at the glutamic 
acid to form a mixture of multiple units of single copy 
polypeptides. The mixture of single copy polypeptides 
also contains polypeptides" having unprotected a-amine 

15 groups and side chain amine groups generated by the 
enzyme cleavage of the intraconnecting peptide. These 
unprotected a«amine groups are protected by reaction 
with maleic anhydride to form a fully protected single 
copy peptide having C- terminal glutamic acid residues. 

20 The C-terminal glutamic acid residues are removed by 
carbo3^eptidase at pH 4.5. 

The removal of the C-terminal glutamic acid and 
protection of a- and e-amine groups can be conducted in 
either order. The fully protected single copy 

25 polypeptide is amidated by a reaction with 

dicyclohexylcarbodiimide in DMP/dcm followed by reaction 
with ammonium hydroxide. Amidation occurs selectively 
at the a-carboxyl C-terminal amino acid to form a 
protected C-terminal a-amide. 

30 The protected C-terminal o-amide of myosin 

light chain kinase inhibitor is deprotected at pH 2 for 
about 2 hours. The a-amidated myosin light chain kinase 
inhibitor is purified by HPLC size exclusion 
chromatography . 
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EXAMPLE 3 

Formation of C-texminal a--Amlde Polypeptide 
fifom a gg^nftt qbinant Single Copy Fusion Protein 

5 The recombinant single copy fusion protein is 

formed as described in Example 1 accept that carbonic 

anhydrase is connected by an arginine to a single copy 

of a polypeptide wound healing factor. The sequence of 

the wound healing factor is Ala-Phe-Ser-Lys-Ala-Phe-Ser- 

10 Lys-Ala-Phe-Ser-Lys-Ala-Phe-Ser-Lys-Ala-Phe-Ser-Lys { SEQ 
ID N0:1). The gene encoding the peptide is produced by 
automated techniques as described in Example 1 and 
combined with the gene for the binding protein and the 
interconnecting peptide in an expression vector as 

15 described in Example 1. The single copy fusion protein 
is expressed and purified as described in Example 1. 

The recombinantly produced fusion protein is 
cleaved at the arginine interconnecting peptide with 
clostripain to form a single copy polypeptide with an 

20 unprotected a-amine group at the N- terminal. 

The single copy polypeptide is reacted with 
maleic anhydride in 5M GuHCl (pH 8 to 8.5) to form a 
protected single copy polypeptide. 

The protected single copy polypeptide is 

25 reacted with water soluble carbodiimide in an excess of 
ammonium hydroxide as an amidating agent to form a 
protected C-terminal a-amide polypeptide. 

The protected C-terminal a-amide polypeptide is 
deprotected at pH 2 for about 2 hours, and the C- 

30 terminal a-amidated wound healing factor is purified by 
HFLC size exclusion chromatography. 

EXAMPLE 4 

Selective Modification of the H«- and C-termlnal 
35 Amino Acid a-Carbon Reactive Groups of a 

Recombinant Polypeptide 

The recombinant single copy fusion protein is 

formed as described in Example 3. The single copy 
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fusion protein contains carbonic anhydrase as the 
binding protein (N- terminal a-axnine protecting group) 
interconnected via the thrombin recognition peptide 
(Arg-Gly-Pro-Arg) (SEQ ID N0s4) to the woxmd healing 
5 factor with an additional C- terminal arginine residue 
(C-terminal a~carboxyl protecting group) • The single 
copy polypeptide is protected at both the N- and 
C-terminal a-carbon reactive groups • The recombinant 
single copy fusion protein is expressed in a transformed 

10 host and purified as described in Example 1. 

The recombinant single copy fusion protein is 
reacted with maleic anhydride in 5M GuHCl(8 to 8.5) to 
form a protected single copy polypeptide. The maleic 
anhydride protects the side chain groups of serine and 

15 lysine. 

The protected single copy fusion protein is 
then cleaved with thrombin. The thrombin cleaves at the 
interconnecting peptide to form a protected polypeptide 
having an unprotected N-terminal a-amine group. 

20 The protected polypeptide with the unprotected 

N-terminal a-amine group is reacted with a first 
modifying agent - a pyroglutymal amino acid, in the 
presence of ceirbodiimide to form an amide bond between 
the N-terminal amino acid and a pyroglutymal residue. 

25 The reaction is conducted in an organic solvent like DMF 
to provide solubility of pyroglutymal and carbodiimide . 
The protected single copy polypeptide is now modified 
selectively at the N-terminal a-*amine reactive group. 

The C-terminal arginine is then cleaved off 

30 with carboxypeptidase B to form a protected single copy 
polypeptide modified at the N-terminal a-amine and 
having an unprotected C-terminal a-carboxyl group. The 
unprotected C-terminal a-carboxyl group is reacted with 
a water soluble carbodiimide and excess anunonium 

35 hydroxide to form a protected single copy polypeptide 
with a N-terminal a-amine modified and C-terminal a- 
carboxyl amide. 
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The protected single copy polypeptide with the - 
C-terminal a-amide and the N-terminal a->axnine 
pyroglutymal residue is deprotected in an acidic 
solution at a pH 2 for two hours . After deprotection, 
5 the final product is a wound healing factor peptide 

modified at the C-terminal a-carboxyl by amidation, and 
modified at the N-terminal a-amine with an additional 
pyroglutymal residue. 

10 EXAMPLE 5 

Replacement of M-termlnal Amino Acids of 
Bradvklnin Derived from A Multicopy Fusion Protein 

The starting material is a multicopy fusion 
protein containing three copies of a truncated 
15 bradykinin peptide interconnected by Asn-Gly to a mini- 
modified carbonic anhydrase (Leu Ser 240). The mini-- 
modified carbonic anhydrase gene is obtained and 
subcloned into the base vector downstream of a T7 
promoter as described in Example 1. The gene for the 
20 multicopy polypeptide is synthesized by automated 
synthesis and includes three copies of the coding 
sequence for amino acid residues 4-9 of bradykinin 
tandomly linked with the coding sequence for Met 61y Asn 
interconnected to the N-terminal of the multicopy 
25 polypeptide as follows (SEQ ID NO: 10): 

Met-61y-Asn-61y-Phe-Ser-Pro-Phe-Arg- 
61y-Phe-Ser-Pro-Phe-Arg-Gly-Phe-Ser- 
Pro-Phe-Arg 

The Met-Gly-Asn serves as interconnecting peptide 
30 cleavable by cyanogen bromide and hydroxylamine . No 
intracoilnecting peptide is necessary as trypsin will 
cleave at the C-terminal arginine. The gene encoding 
the multicopy polypeptide with interconnecting peptide 
is cloned downstream from the mini-modified carbonic 
35 anhydrase as described in Example 1. The vector 
containing the gene sequence for the recombinant 
multicopy fusion protein is introduced into a host 
organism as described in Example 1. The recombinant 
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multicopy fusion protein is expressed and purified, as — 
described in Example 1. 

The purified multicopy fusion protein is 
cleaved with Im cyanogen bromide, pH 8 at 37 •^C to remove 
5 the carbonic anhydrase fragment and form a biological 
protected demifusion protein. After capture of the 
cleaved carbonic anhydrase with a sulfanilamide column, 
the serine hydroxyl groups of the separated demifusion 
protein can be protected by reaction with maleic 
10 anhydride. The biological protecting group can be 
cleaved with 2M hydroxylaminie in 5M GuHCl, pH 8.0 at 
37 to form a multicopy polypeptide. The multicopy 
polypeptide is cleaved with trypsin to form a truncated 
single copy polypeptide with unprotected N-terminal 
15 a-amine reactive groups. 

The first three amino acids of bradykinin 
containing a hydroxyproline residue (Hyp) are 
synthesized by solid phase or solution chemistry. The 
Arg-Pro-Hyp peptide is synthesized by first forming the 
20 9-fluorenyl methyloxycarbonyl hydroxyproline (FMOC) 
o-benzylether derivative (FMOC derivative). The FMOC 
hydroxyproline derivative is reacted with the hydroxide 
resin to produce FMOC-Hyp-resin. The FMOC is removed 
with piperidine and DCM (dichloromethane) . A 
25 dicyclohexylcarbodiimide activated FMOC-proline 

derivative is then reacted with the resin bound NHj-Hyp. 
The cycle is repeated for FM0C-Arg-(methoxy-2, 3,6- 
trimethylbenzine sulfonyl). The protected peptide is 
then cleaved from the resin with 25% trifluoroacetic 
30 acid in dichloromethane. 

The protected N-terminal tripeptide: 
Arg- (methoxy-2 ,3,6 trimethylbenzine sulfonyl ) -Pro-Hyp- 
COOH is activated with dicyclohexyl carbodiimide in 
dichloromethane and dimethyl formamide . The activated 
35 peptide is then reacted with a twofold excess of 

recombinant ly produced truncated bradykinin (amino acid 
residues 4-9) to produce Hyp-3-bradykinin. Excess 
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recombinantly produced bradykinln (amino acids 4-9) can - 
be recovered and used again. 

EXAMPLE 6 

5 Formation of N and C-Terminally Modified 

Growth Hormone Releasing Factor (ORF) 
Derived From a Multicopy Fusion Protein 

The starting material is a multicopy fusion 
10 protein containing two copies of growth hormone 

releasing factor intraconnected to form a multicopy 
polypeptide connected to carbonic anhydrase. The 
interconnecting peptide and intraconnecting peptide are 
the same and contain a recognition sequence for an 
15 enzymatic cleavage reagenb and a recognition sequence 
for a chemical cleavage reagent. The sequence (SEQ ID 
NO: 11) of the inter- and intraconnecting peptide is: 

Asn^-Gly-Pro-ArgB 

20 

A = hydroxylamine cleavage site 
B = thrombin cleavage site 

The gene sequence for the carbonic anhydrase is 

25 obtained and subcloned into the base vector downstream 
of the T7 promoter, as described in Example 1. The gene 
sequence for growth releasing factor containing the 
inter- or intracoimecting peptide at the N-terminal end 
is synthesized by automated oligonucleotide synthesis. 

30 The gene sequence with the interconnecting peptide is 
subcloned immediately downstream from the carbonic 
anhydrase gene. The gene sequence with the 
intraconnecting peptide is subcloned immediately 
downstream from the first copy of the growth releasing 

35 factor gene. The vector is then introduced into a 
bacterial host and expression of the recombinant 
multicopy fusion protein is induces as described in 
Example 1. The recombinant multicopy fusion protein is 
purified as described in Example 1. 

40 The recombinant multicopy fusion protein is 

then cleaved with hydroxylamine. Hydroxylamine cleaves 
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at the Asn-Gly recognition sequence in the inter- and 
intraconnecting peptides to form single copy 
polypeptides with N-terminal Gly-Pro-Arg peptide and a 
C-terminal Asn residue. 
5 The single copy polypeptide is then reacted 

with maleic anhydride to protect e-amine and hydroxyl 
groups. The ^- and r-carboxyl groups are protected by 
formation of o-nitrophenol esters at those groups. 

The single copy polypeptide is then cleaved 

10 with carboxypeptidase to remove the C-terminal Asn 

residue. The unprotected C-terminal a-carboxyl group is 
amidated by the reaction of the protected single copy 
polypeptide with dicyclohexylcarbodiimide followed by an 
excess of ammonia. 

15 The single copy polypeptide is then cleaved 

with thrombin to remove the N-terminal biological 
protecting group-Gly-Pro-Arg. The unprotected 
N-terminal a-amine is then reacted with a urethane 
blocked pyroglutamyl residue to foann a protected 

20 N- terminally modified, C-terminally modified single copy 
polypeptide. The terminally modified single copy 
polypeptide is deprotected at about pH-2 for 2 hours, 
followed by deprotection at pH=9 for about 2 hours. The 
final product is growth releasing factor modified at the 

25 N-terminal with a pyroglutamyl residue and modified at 
the C-terminal by amidation. 
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SEQUENCE LISTING 



(1) GENERAL INFORHATION: 

(i) APPLICANT: Stout, Jay 

Wagner, Fred W. 
Coolidge, Thomas R. 
Holmgulst, Barton 

(ii) TITLE OF INVENTION: METHOD FOR MODIFICATION OF 
RECOMBINANT POLYPEPTIDES 

(ill) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Merchant & Gould 

(B) STREET: 3100 Norwest Center 

(C) CITY: Minneapolis 

(D) STATE: MN - 

(E) COUNTRY: USA 

(F) ZIP: 55402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: 

Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Nelson, Albin J. 

(B) REGISTRATION NUMBER: 28,650 

(C) REFERENCE /DOCKET NUMBER: 8648 . 35-WO-Ol 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 612-332-5300 

(B) TELEFAX: 612-332-9081 
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(2) INFORHATION FOR SEQ ID NO: It 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Ala Fhe Ser Lys Ala Phe Ser Lys Ala Fhe Ser Lys Ala Phe Ser Lys 
15 10 15 



Ala Phe Ser Lys 
20 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Asp Asp Asp Asp Lys 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3i 

lie Glu Gly Arg 
1 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4! 

Arg Gly Pro Arg 
1 
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(2) INFORU&TION FOR SEQ ID N0:5: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) UOLECULE TYPE: peptide 

(zi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

His Pro Phe His Leu Leu Val Tyr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Phe Val Asp Asp Asp Asp Lys Phe Val Asn Gly Pro Arg Ala Met Phe 

1 5 . 10 15 

Val Asp Asp Asp Asp Lys Val Asn Gly Pro Arg Ala Met Ala 
20 25 30 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) T0F0L067: linear 

(ii) HOLECDLE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

lie Asn Leu Ljb Ala Leu Ala Ala Ala Leu Ala L78 Lys lie Leu 
1 5 10 15 



(2) INFORMATION FOR SEQ ID N0:8t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID M0t8: 

Val Asp Asp Asp Lys 
1 5 
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(2) INFOR£[&TION FOR SEQ ID N0:9i 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 amino acids 
(6) TYPE: amino acid 

(C) STRAMDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Lys Arg Arg Trp Lys Lys Asn Phe Ala Val 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met 61y Asn 61y Phe Ser Pro Phe Arg 6I7 Phe Ser Pro Phe Arg 
15 10 15 



6I7 Phe Ser Pro Phe Arg 

20 
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(2) INFORHATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) T7PE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGT: linear 

(ii) MOLECDLE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Asn Gly Pro Arg 
1 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) T7PE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TTPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GACGACGACG ATAAA 15 
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(2) INFORU&TION FOR SEQ ID HO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATTGAAGGAA 6A 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
A6AGGACCAA 6A 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 15: 
CATCCTTTTC ATCT6CTG6T TTAT 
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WHAT IS CLAIMED IS: 

1. A method for preparing a recombinant single copy 
polypeptide or a portion thereof with a modified 
terminal amino acid a-carbon reactive group selected 
5 from the group consisting of N- terminal a-amine, 

C-terminal a-carboxyl and a combination thereof, and 
reactive side chain groups selected from the group 
consisting of an e-amine group, a hydroacyl group, a 
^-carboxyl group, a r-carboxyl group, a thiol group, 

10 and a combination thereof, comprising: 

forming the recombinant single copy 
polypeptide or a portion thereof so that the single 
copy polypeptide is protected with one or more 
biologically added protecting groups on the terminal 

15 amino acid a-carbon reactive group selected from the 

group consisting of N-terminal a-amine, C-terminal 
a-carboxyl and a combination thereof; 

conducting the following reacting and 
cleaving steps in any order to produce a side chain 

20 protected single copy polypeptide having at least 

one unprotected terminal amino acid a-carbon 
reactive group; 

reacting the recombinant single copy 
polypeptide with up to three chemical protecting 

25 agents to selectively protect a reactive side chain 

group selected from the group consisting of e-amine, 
hydroxyl, /5-carboxyl, r-carboxyl, thiol, and a 
combination thereof; 

cleaving the recombinant single copy 

30 polypeptide with at least one cleavage reagent 

specific for the biologically added protecting group 
to form an unprotected terminal amino acid a-*carbon 
reactive group; 

modifying the unprotected terminal amino 

35 acid o-carbon reactive group with at least one 

chemical modifying agent to form a terminally 
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modified side chain protected single copy 
polypeptide; and 

deprotecting the terminally modified side 
chain protected single copy polypeptide to form the 
5 terminally modified single copy polypeptide, 

2. The method according to claim 1, wherein 

the recombinant single copy polypeptide is 
formed with a biologically added protecting group at 
10 the N- terminal a-amine reactive side chain group by 

an amide bond connection to a different polypeptide, 
the different polypeptide comprising either an 
interconnecting peptide or a binding protein 
connected to an interconnecting peptide and the 
15 interconnecting peptide, having at least one site 

cleavable by a chemical or enzymatic reagent and 
being the amide bond connection to the recombinant 
single copy polypeptide, and 

the cleavage reagent specific for the 
biologically added protecting group is an enzyme or 
chemical that cleaves at the interconnecting 
peptide, and 

the chemical modifying agent acts to form 
an acetyl group at the N-terminal o-amine group. 

25 

3. The method according to claim 1, further comprising: 

forming a recombinant single copy 
polypeptide so that the single copy polypeptide is 
protected by a first biologically added protecting 
group at the N-terminal a-amine group by an amide 
bond connected to a different polypeptide, the 
different polypeptide being either an 
interconnecting peptide or a binding protein 
connected to an interconnecting peptide and the 
35 interconnecting peptide being the amide bond 

connection and the single copy polypeptide is 
protected by a second biologically added protecting 



20 



30 
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group at the C-terminal a-carboxyl group by an amide 
bond connection to an arginine; 

cleaving the recombinant single copy 
polypeptide with the cleavage reagent specific for 
the first biologically added protecting group to 
form a side chain protected single copy polypeptide 
having an unprotected N-terminal a^amine; 

modifying the unprotected N-*terminal 
a-amine with the first chemical modifying agent to 
form an N*terminal a-amine modified side chain 
protected single copy polypeptide; 

cleaving the N-terminal a-amine modified 
side chain protected single copy polypeptide with a 
second cleavage reagent specific for the second 
biologically added protecting group to form an 
N-terminal a-amine modified side chain protected 
single copy polypeptide with an unprotected 
C -terminal a-carboxyl group; and 

modifying the unprotected C-terminal 
a-carbo3r^l group with a second modifying agent to 
form a N-terminal and C-*terminal modified side chain 
protected single copy polypeptide. 

A method for the preparation of a recombinant single 
copy polypeptide or a portion thereof with a 
modified C-terminal a-carboxyl group, an N-terminal 
a-amine group, and reactive side chain groups ^ 
selected from the group consisting of an e-amine 
group, a hydroxy 1 group, a /3-carboxyl group, a 
r-carboxyl group, a thiol group, and a combination 
thereof, which is obtained from expression and 
purification of a recombinant single copy fusion 
protein of three tandem coupled segments, the first 
segment being a binding protein, the second segment 
being an interconnecting peptide and the third 
segment being the single copy polypeptide or a 
portion thereof, comprising: 
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cleaving the recombinant single copy 
fusion protein with a cleavage reagent specific for 
the interconnecting peptide to produce a single copy 
polypeptide or a portion thereof; 
5 reacting the single copy polypeptide or a 

portion thereof with up to three chemical protecting 
agents to selectively protect a reactive side chain 
group selected from the group consisting of £-*amine, 
hydroxyl, /S-carboxyl, r-carboxyl, thiol, and a 

10 combination thereof, to form a side chain protected 

single copy polypeptide with an unprotected 
C- terminal a-carboxyl group; 

modifying the C-terminal a-carboxyl group 
of the side chain protected single copy polypeptide 

15 with a chemical modifying agent to form a C-terminal 

o-carboxyl modified side chain protected single copy 
polypeptide; and 

deprotecting the C-terminal a-carboxyl 
modified side chain protected single copy 

20 polypeptide to form the C-terminal a-carboxyl 

modified recombinant single copy polypeptide. 

5. A method for preparing a modified recombinant single 
copy polypeptide or a portion thereof with a 

25 modified terminal amino acid a-carbon reactive group 

selected from the group consisting of N-terminal 
a-amine, C-terminal a-carboxyl and a combination 
thereof, and reactive side chain groups selected 
from the group consisting of an €-amine group, a 

30 hydroxy 1 group, a iS-carboxyl group, a r-carboxyl 

group, and a combination thereof comprising: 

forming the recombinant single copy 
polypeptide or a portion thereof so that the single 
copy polypeptide is protected with one or more 

35 biologically added protecting groups on the terminal 

amino acid a-carbon reactive group selected from the 
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group consisting of N- terminal o-amine, C-terminal . 
a-carboxyl, and a combination thereof; 

cleaving the recombinant single copy 
polypeptide or a portion thereof with at least one 
5 cleavage reagent specific for the biologically added 

protecting group to form a recombinant single copy 
polypeptide with an unprotected terminal amino acid 
a-carbon reactive group; and 

modifying the unprotected terminal amino 
10 acid a-carbon reactive group with at least one 

chemical modifying agent to form a terminally 
modified single copy polypeptide* 



6. A method according to claim 1, wherein in the step 
15 of forming a recombinsmt single copy polypeptide or 

a portion thereof, a portion of the recombinant 
single copy polypeptide lacks about 1 to about 10 
terminal amino acids. 

20 7. The method according to claim 1, wherein the 
chemical protecting agent is an agent that 
selectively protects amine and hydroxyl groups 
selected from the group consisting of alkyl 
substituted anhydrides, aryl substituted anhydrides, 

25 alkoxy substituted anhydrides, diazo compounds, 

cyclic anhydrides, alkyl substituted carbamating 
agents, and aryl substituted carbamating agents. 



8. The method of claim 1, wherein in the step of 
30 reacting the single copy polypeptide with up to 

three protecting agents, the single copy polypeptide 
is reacted with a protecting agent that selectively 
protects amine groups and is then reacted with a 
protecting agent that selectively protects carboxyl 
35 groups • 
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9. The method according to claim 1, wherein In the step 
of deprotectlng the terminally modified single copy 
polypeptide comprises incubating the protected 
single copy polypeptide at a pH about 2-4 until 
5 substantially all of the protecting groups are 

removed • 



10 



10. A biologically added protecting group for a 

recombinant single copy polypeptide comprising: a 
peptide or amino acid that contains at least one 
recognition sequence for a cleavage reagent. 



11. A biologically added protecting group according to 
claim 10, wherein the peptide contains more than one 

15 cleavage recognition sequence. 

12. A biologically added protecting group according to 
claim 11, wherein the peptide contains an enzymatic 
cleavage recognition site and a chemical cleavage 

20 recognition site. 

13. A biologically added protecting group according to 
claim 10, wherein the recognition sequence is not 
present in the single copy polypeptide. 

25 

14. A method for preparing a recombinant single copy 
polypeptide or a portion thereof with a modified 
teminal amino acid a-carbon reactive group selected 

30 from the group consisting of N-terminal a-amine, 

C-terminal o-carboxyl and a combination thereof, and 
reactive side chain groups selected from the group 
consisting of an e-amine group, a hydroxyl group, a 
^-carboxyl group, a r-carboxyl group, a thiol group, 

35 and a combination thereof, the recombinant single 

copy polypeptide being obtained from expression and 
purification of a recombinant multicopy polypeptide. 



wo 94/01451 



PCr/US93/06591 



96 

the recombinant multicopy polypeptide having 
multiple copies of the single copy polypeptide 
coimected by an intraconnecting peptide, comprising: 
forming the recombinant multicopy 
5 polypeptide so that it is protected with one or more 

biologically added protecting group on a terminal 
amino acid a-carbon reactive group selected from the 
group consisting of N-terminal a-amine, C-terminal 
a-carboxyl, and a combination thereof; 

10 conducting the following reacting and 

cleaving steps in any order to produce a side chain 
protected single copy polypeptide having at least 
one unprotected terminal amino acid a-carbon 
reactive group; 

15 reacting the recombinant single or 

multicopy polypeptide with up to three chemical 
protecting agents to selectively protect a reactive 
side chain group selected from the group consisting 
of 6-amine, hydroxyl, p-carboxyl, r-carboxyl, thiol, 

20 and a combination thereof; 

cleaving the recombinant multicopy 
polypeptide with at least one cleavage reagent 
specific for the biologically added protecting group 
to f 03nn an unprotected terminal amino acid a-cau:bon 

25 reactive group; 

modifying the unprotected N- terminal 
a-amine or C-terminal ot-carboxyl group with a 
chemical modifying agent to form a modified 
N-terminal a-amine or C-terminal a-carboxyl side 

30 chain protected single copy polypeptide; and 

deprotecting the N-terminal a-amine 
modified or C-terminal a-carboxyl modified side 
chain protected single copy polypeptide to form a 
terminally modified single copy polypeptide. 



35 



15. A method according to claim 14, further comprising 
reacting the recombinant single copy polypeptide 
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with up to three chemical protecting agents to form - 
a side chain protected single copy polypeptide with 
the step of cleaving the recombinant multicopy 
polypeptide with a cleavage reagent specific for the 
intraconnecting peptide to form a single copy 
polypeptide • 

A method according to claim 15, further comprising: 
forming the recombinant multicopy 

polypeptide with a biologically added protecting 

group comprising: 

two tandomly coupled segments, the first 

segment being a binding protein and the second 

segment being an interconnecting peptide or a single 

tandomly coupled segment which is the 

interconnecting peptide, the interconnecting peptide 

being connected to the terminal amino a-carbon 

reactive group of the multicopy polypeptide; and 
cleaving the recombinant multicopy 

polypeptide with a cleavage reagent specific for 

the interconnecting peptide to form a 

recombinant multicopy polypeptide. 

A method according to claim 16, further comprising: 

forming the recombinant multicopy 
polypeptide so that it is protected with an 
N-terminal biologically added protecting group and a 
C-terminal biologically added protecting group; 

cleaving the recombinant multicopy 
polypeptide with a cleavage reagent specific for the 
N-terminal biologically added protecting group to 
form a side chain protected single copy polypeptide 
with an unprotected N-terminal a-amine reactive 
group; 

modifying the unprotected N-terminal 
a-amine with a first modifying agent to form a 
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modified N-terminal a-amine side chain protected 
single copy polypeptide; 

cleaving the N-terminal a-amine modified 
side chain protected single copy polypeptide with 
5 cleavage reagent specific for the C-terminal 

biologically added protecting group to form a 
modified N-terminal a-amine side chain protected 
single copy with an unprotected C-terminal a-carbon 
reactive group; and 
10 modifying the C-terminal a-carbon reactive 

group to form a modified N-terminal a-amine, 
modified C-terminal a-carboxyl side chain protected 
single copy polypeptide. 

15 18. A method according to claim 17, wherein the 

N-terminal biologically added protecting group is 
two segments tandomly coupled together, the first 
segment being a binding protein, the second segment 
being an intercoimecting peptide or is one segment 

20 which is the interconnecting peptide, and the 

interconnecting peptide being connected to the 
N-terminal amino acid of the multicopy polypeptide 
and the C-terminal biologically added protecting 
group is arginine. 

25 

19. A method for preparing a recombinant single copy 
polypeptide or portion thereof with a modified 
terminal amino acid a-carbon selected from the group 
consisting of N-terminal a-amine, C-terminal 

30 a-carboxyl and a combination thereof, and reactive 

side chain group consisting of an e-amine group, a 
hydroxyl group, a /3-carboxyl group, a a-carboxyl 
group, and a thiol group, and a combination thereof, 
the recombinant single copy polypeptide or a portion 

35 thereof being obtained from expression and 

purification of recombinant multicopy polypeptide, 
the recombinant multicopy polypeptide having 
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multiple copies of the single copy polypeptide or a 
portion thereof tandomly linked together, 
comprising: 

forming the recombinant multicopy 
5 polypeptide so that it is protected with one or more 

biologically added protecting group on a terminal 
amino acid a-carbon reactive group selected from the 
group consisting of N- terminal a-amine, C-terminal 
a-carboxyl and a combination thereof; 

10 cleaving the recombinant multicopy 

polypeptide with at least one cleavage reagent that 
forms single copy polypeptides with an unprotected 
terminal amino acid a-carbon reactive group; and 
modifying the unprotected terminal 

15 a-carbon reactive group with a chemical modifying 

agent to form a terminally modified single copy 
polypeptide • 

20. The method according to claim 19, wherein the 
20 chemical protecting agent is an agent that 

selectively protects amine and hydroxyl groups 
selected from the group consisting of alkyl 
substituted anhydrides, aryl substituted anhydrides, 
alkoxy substituted anhydrides, diazo compounds, 
25 cyclic anhydrides, alkyl substituted carbamating 

agents, and aryl substituted carbamating agents. 

21. The method of claim 19, wherein in the step of 
reacting the single copy polypeptide with up to 

30 three protecting agents, the single or multicopy 

polypeptide is reacted with a protecting agent that 
selectively protects amine groups and is then 
reacted with a protecting agent that selectively 
protects carboxyl groups. 



35 



22- The method of claim 19, wherein the chemical 

modifying agent is an amidating agent selected from 



wo 94/01451 



PCrAJS93/06591 



100 

the group consisting of carbodiimides and ammonia, 
acid chlorides and ammonia, mixed anhydrides and 
ammonia, azides and ammonia, o-nitrophenol esters 
and ammonia, 1-hydroxybenzotriazole esters and 
5 ammonia • 



23. The method according to claim 19, wherein in the 

step of deprotecting the terminally modified single 
copy polypeptide comprises incubating the protected 
10 single copy polypeptide at a pH about 2-4 until 

substantially all of the protecting groups are 
removed. 



24. The method according to claim 2, wherein the 
15 interconnecting peptide contains at least two 

cleavage sites and the cleavage site nearest the 
binding protein is cleaved to form the biologically 
added protecting group from the interconnecting 
peptide residue. 

20 

25. A method according to claim 24, wherein the 
interconnecting peptide contains at least two 
cleavage sites and the cleavage site nearest the 
binding protein is cleaved to form the biologically 

25 added protecting group from the interconnecting 

peptide residue. 
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